Hi 


I 


i 


THE  READABILITY  OF 
CERTAIN  TYPE  SIZES  AND  FORMS 
IN  SIGHT-SAVING  CLASSES 


By  HAROLD  J.  McNALLY,  Ph.D. 


TEACHERS  COLLEGE,  COLUMBIA  UNIVERSITY 
CONTRIBUTIONS  TO  EDUCATION,  NO.  883 


^Published  with  the  Approval  of  Professors  Arthur  I.  Gates 
and  Robert  L.  Thorndike,  Co-sponsors 


bureau  of  Vublications 

TEACHERS  COLLEGE,  COLUMBIA  UNIVERSITY 

NEW  YORK  1943 


COPYRIGHT,  1943,  BY  HAROLD  J.  MCNALLY 


fims  7 

/ii 


TYPOGRAPHY,  PRINTING,  AND  BINDING  IN  THE  U.  S.  A.  BY 
KINGSPORT  PRESS,  INC.,  KINGSPORT,  TENNESSEE 


ACKNOWLEDGMENTS 


The  prosecution  of  a  study  such  as  this  necessitates  the  assistance 
and  cooperation  of  a  great  many  persons.  Among  the  many  to 
whom  I  am  deeply  indebted,  particular  thanks  are  due: 

Dr.  Arthur  I.  Gates,  co-sponsor,  who  will  understand  that  my 
gratitude  exceeds  my  vocabulary; 

Dr.  Robert  L.  Thorndike,  co-sponsor,  who  gave  friendly  en¬ 
couragement  and  invaluable  assistance  at  all  stages  of  the  research; 

Dr.  Harold  W.  McCormick,  Director  of  the  Survey  of  the 
Study  of  the  Care  and  Education  of  the  Physically  Handicapped 
in  the  City  of  New  York,  who  made  necessary  administrative 
arrangements,  and  gave  valuable  advice; 

The  members  of  the  Subcommittee  on  Readability  of  the  Com¬ 
mittee  for  the  Study  of  the  Care  and  Education  of  Physically 
Handicapped  Children  in  the  Public  Schools  of  the  City  of  New 
York,  for  valuable  criticism  and  suggestions; 

The  Mergenthaler  Linotype  Company,  for  the  loan  of  es¬ 
sential  matrices  and  other  materials; 

The  A.  B.  Dick  Company,  manufacturers  of  Mimeograph 
Products,  for  supplying  the  stencils  and  doing  the  Mimeography 
necessary  in  the  experiment; 

And  others,  too  numerous  to  mention  here,  who  were  of  real 
assistance  in  the  conduct  of  the  study. 


H.  J.  MCN. 


;; 


TABLE  OF  CONTENTS 


CHAPTER  PAGE 

I.  THE  PROBLEM:  ITS  ORIGIN  AND  DEFINITION  .  1 

Importance  of  Sight-Saving  Classes . 1 

Need  for  Objective  Evidence  on  the  Readability  of  Type 

Sizes  in  Sight-Saving  Classes . 1 

Selection  of  Types  to  Be  Studied . 5 

Definition  of  Readability . 8 

Summary . 8 

II.  SELECTION  OF  THE  CRITERIA . 10 

Difficulty  of  Locating  Satisfactory  Criteria  .  .  .  .  10 

Methods  Used  in  Other  Studies . 11 

Criteria  to  Be  Used  in  This  Study . 18 

Summary . 19 

III.  CONDITIONS  OF  THE  EXPERIMENT . 20 

Length  of  the  Sampling  Periods . 20 

Selection  of  the  Reading  Tests  to  Be  Used . 21 

Typographical  Conditions  of  the  Tests . 24 

The  Paper  Used  for  Printing  the  Tests . 28 

Subjects . 30 

Testing  Conditions  and  Methods . 36 

Summary . 38 

IV.  EXPERIMENTAL  DESIGN  AND  STATISTICAL 

METHOD . 39 

Importance  of  Experimental  Design . 39 

Design  of  the  Experiment . 41 

Analysis  of  the  Data . 42 

Summary . 52 

V.  ANALYSIS  AND  INTERPRETATION  OF  THE 

RESULTS . 54 

Possible  Interpretations  of  the  Results . 54 

Evaluation  of  Possible  Interpretations . 55 


VI 


TABLE  OF  CONTENTS 


CHAPTER  PAGE 

VI.  SUMMARY  AND  CONCLUSIONS . 60 

The  Problem . 60 

The  Criteria . 60 

The  Tests . 60 

Paper . 61 

The  Subjects . 61 

Administration  of  the  Tests . 62 

Methods  of  Analysis  of  the  Data . 62 

Results . 63 

Interpretation  of  the  Results . 64 

Conclusions . 65 

BIBLIOGRAPHY . 67 


THE  READABILITY  OF 
CERTAIN  TYPE  SIZES  AND  FORMS 
IN  SIGHT-SAVING  CLASSES 


$ 

t 


I 


THE  PROBLEM:  ITS  ORIGIN 
AND  DEFINITION 


IMPORTANCE  OF  SIGHT-SAVING  CLASSES 

An  education  suited  to  the  needs  and  limitations  of  atypical  chil¬ 
dren  is  an  obligation  of  a  democratic  society.  Prominent  in  num¬ 
bers  among  such  children  are  those  with  only  partial  vision.  In 
1931,  the  White  House  Conference  stated  that  a  reasonable  esti¬ 
mate  of  the  number  of  such  children  in  the  United  States  needing 
special  educational  provision  was  approximately  one  in  every 
500,  or  something  over  50,000  children.^  Many  of  these  are 
already  in  sight-saving  classes,  of  which  there  are  now  635  in  the 
United  States.  These  classes  represent  227  cities,  31  states,  the 
District  of  Columbia,  and  the  territory  of  Hawaii.^  In  New 
York  City  alone  there  are  88  schools  with  sight-saving  classes,® 
enrolling  approximately  2,000  pupils.  In  other  words,  this  is  a 
sizable  and  important  undertaking.  Any  attempt  to  determine 
experimentally  the  best  methods  or  materials  for  use  in  these 
classes  is  of  real  practical  value. 

NEED  FOR  OBJECTIVE  EVIDENCE  ON  THE  READABILITY 
OF  TYPE  SIZES  IN  SIGHT-SAVING  CLASSES 

One  of  the  problems  encountered  in  providing  a  sight-con- 

^  White  House  Conference,  Special  Education:  The  Handicapped  and  the 
Gifted,  p.  115.  The  Century  Co.,  New  York,  1931. 

2  These  figures  were  supplied  by  Mrs.  Winifred  Hathaway,  Associate  Di¬ 
rector  of  the  National  Society  for  the  Prevention  of  Blindness,  in  a  letter  dated 
January  19,  1942. 

^  Committee  for  the  Study  of  the  Care  and  Education  of  Physically  Handi¬ 
capped  Children  in  the  Public  Schools  of  the  City  of  New  York,  Report  of  the 
Reading  Subcommittee  for  the  Visually  Handicapped,  p.  3.  New  York  City 
Board  of  Education,  March,  1940. 
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serving  program  of  education  for  partially  sighted  pupils  is  that 
of  recommending  a  size  of  type  which  is  a  practical  optimum  for 
them  with  respect  to  readability.  This  study  was  designed  to 
attack  that  problem  by  attempting  to  evaluate  the  readability  of 
certain  types  and  type  sizes  for  children  with  handicapped  vision 
in  the  sight-saving  classes  of  New  York  City. 

The  need  for  studies  of  this  kind  has  been  widely  recognized 
among  those  directly  responsible  for  the  education  of  partially 
sighted  pupils.  At  present,  the  reading  materials  used  in  sight¬ 
saving  classes  are  commonly  printed  in  extra-large  type.  Gen¬ 
erally  this  is  “Clear  Type,”  a  widely  spaced  and  leaded  Caslon 
Bold  type  face,  approximately  6.25  mm.  in  height  for  capitals 
and  lower  case  letters  with  ascenders  or  descenders.  Not  every¬ 
thing  the  pupils  read  is  available  in  this  type  size,  however,  and 
teachers  commonly  meet  this  lack  by  enlarging  reading  matter 
with  the  use  of  special  typewriters  with  large  type.  The  capitals 
and  full  height  letters  on  these  typewriters  range  from  5  mm.  to 
5.5  mm.  in  height;  the  lower  case  letters  are  from  3.5  mm.  to 
about  4  mm.  in  height.  These  heights  are  uniform  on  any  given 
machine,  of  course,  varying  with  the  make  of  the  typewriter.  In 
some  classes,  stencils  are  cut  on  these  typewriters,  and  are  used 
to  duplicate  reading  matter  with  a  machine  duplicator.  In  ad¬ 
dition,  many  of  the  teachers  spend  large  amounts  of  time  repro¬ 
ducing  reading  matter  in  large  lettering  by  hand. 

A  search  for  objective  evidence  to  support  these  practices, 
however,  revealed  practically  none  whatever.  Studies  of  the 
readability  of  type  sizes  with  reference  to  persons  of  normal  or 
near-normal  vision  are  reported,  but  almost  nothing  appears  with 
reference  to  any  of  the  aspects  of  readability  for  the  partially 
sighted.  Indeed,  the  only  published  study  of  this  kind  that  this 
investigator  found  was  one  by  Luckiesh  and  Moss,  and  that  was 
concerned  primarily  with  the  static  factor  of  visibility  of  18- 
point  and  24-point  type."^ 

*  Matthew  Luckiesh  and  Frank  Moss,  The  Science  of  Seeing,  pp.  266-267. 
D.  Van  Nostrand  Co.,  New  York,  1937. 
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The  basis  for  the  present  use  of  24-point  type  in  these  classes 
was  provided  by  a  series  of  studies  made  by  Robert  B.  Irwin  in 
1920.®  These  investigations  were  carried  out  with  pupils  in 
several  cities,  chiefly  in  Cleveland,  and  included  types  ranging 
from  10-point  to  36-point  in  size.  The  study  upon  which  most 
of  the  ultimate  conclusions  were  based  used  44  words  “chosen 
with  special  reference  to  letters  such  as  o,  e,  a,  m,  n,  b,  d,  which 
might  readily  be  mistaken  by  a  child  with  poor  sight,  if  the  type 
was  illegible  to  him;  or  to  such  combinations  as  lo,  li,  bi,  fi,  which 
must  be  seen  clearly  to  be  read  correctly.”  ®  The  criteria  were 
time,  errors,  average  reading  distance,  and  apparent  effort. 

Irwin  concluded  that,  when  other  factors  such  as  leading  and 
letter  spacing  are  optimum,  “a  24-point  type  is  preferred  by  most 
(visually  handicapped)  children.”  ^  In  an  attempt  to  ascertain  the 
effect  of  eye  condition  upon  type  preference,  the  data  were 
analyzed  according  to  several  visual  defects,  and  the  difference 
in  preferences  analyzed.  The  conclusion  was  that,  “The  same 
type  .  .  .  which  is  most  legible  for  children  in  one  group  is  also 
most  legible  for  the  majority  in  the  other  groups.”  ® 

Other  more  recent  investigators  and  authorities,  including 
many  of  those  supervising  or  teaching  in  these  classes,  have 
raised  the  question  of  whether  24-point  type  is  the  best  type  for 
all  partially-sighted  children.  In  reporting  a  study  of  the  visi¬ 
bility  of  18-point  and  24-point  types  (type  face  unspecified), 
Luckiesh  and  Moss  make  the  following  statement: 

For  the  cases  involving  medium  myopia,  .  .  .  subjects  obtained 
visibility  values  nearly  as  high  as  did  the  group  of  adults  possessing 
normal  vision.  Thus  for  them  the  24-point  type  was  probably  un¬ 
necessarily  large.  .  .  .  On  the  other  hand,  the  pupils  with  cataracts 
obtained  a  visibility  of  only  2.60  and  4.31  when  observing  18-point 
and  24-point  type  respectively.  ...  In  other  words,  18-point  and 
24-point  types  are  no  more  visible  to  the  pupils  with  cataracts  than 
are  6-point  and  9-point  types  for  adults  with  normal  vision.  Hence, 

®  White  House  Conference,  op.  cit.,  p.  144. 

®  Robert  B.  Irwin,  “Large  Type  Reading  Tests,”  p.  4.  (Unpublished  report 
of  the  studies  discussed.)  American  Foundation  for  the  Blind,  Inc.,  New  York 
Ibid.,  p.  10.  ^  Ibid. 
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it  follows  that  these  pupils  need  the  24-point  type  in  order  to  ob¬ 
tain  a  reasonable  degree  of  visibility.^ 

The  report  of  a  subcommittee  of  the  Committee  for  the  Study 
of  the  Care  and  Education  of  Physically  Handicapped  in  the 
Public  Schools  of  the  City  of  New  York  states  that: 

Present  research  reports  that  deal  with  written  materials,  lighting, 
and  the  physical  conditions  of  reading  for  partially-sighted  pupils 
are  controversial.  Further  studies  should  be  made  to  determine  stand¬ 
ards  for  these  pupils.  .  .  .  These  studies  should  investigate  size, 
kind,  color,  and  ink-density  of  type,  spacing  of  letters,  words  and 
lines,  lighting,  and  other  physical  conditions  important  in  reading. 
The  relative  readability  of  materials  produced  by  mimeograph,  hec¬ 
tograph,  typewriter,  photo-offset,  multigraph,  and  print,  should  be 
considered,  including  both  teacher-made  and  commercial  materials. 
.  .  .  Recommendations  for  desirable  materials  and  conditions  of 
reading  in  these  classes  in  the  future  depend  upon  the  results  of  such 
studies.^^^ 

The  Subcommittee  on  the  Blind  and  Partially  Seeing  made  the 
following  statement  in  their  section  of  the  White  House  Con¬ 
ference  report: 

If  further  investigation  should  show  that  for  many  sight-saving  class 
pupils  18-point  type  would  be  quite  as  satisfactory  as  24-point,  it 
would  be  of  great  help  in  solving  the  problem  of  having  a  sufficient 
number  of  books,  since  many  modern  libraries  are  now  specializing 
in  “books  for  tired  eyes.”  The  demand  for  these  from  a  much  wider 
group  of  people  than  the  limited  number  in  sight-saving  classes 
would  appreciably  lower  the  cost  of  material,  and  remove  one  of  the 
chief  objections  to  the  establishment  of  sight-saving  classes.^^ 

It  seems  evident,  therefore,  that  there  is  a  felt  need  for  experi¬ 
mentation  to  gather  more  evidence  on  the  readability  of  type 
sizes  in  such  classes.  Such  studies  might  support  any  one  of  the 
following  possibilities: 

1.  Present  usage  (24-point  type  and  enlarged  typed  and  dupli¬ 
cated  material)  makes  for  the  greatest  readability  with 
sight-saving  pupils. 

®  Luckiesh  and  Moss,  op.  cit.,  pp.  266-267. 

Committee  for  the  Study  of  the  Care  and  Education  of  Physically  Handi¬ 
capped  Children  in  the  Public  Schools  of  the  City  of  New  York,  op.  cit.,  p.  56. 

White  House  Conference,  op.  cit.,  p.  144. 
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2.  Some  type  (or  types)  smaller  than  24-point  is  more  read¬ 
able  for  these  pupils. 

3.  Some  type  smaller  than  24-point  is  equally  readable  for 
these  pupils. 

4.  Type  sizes  vary  in  their  readability  for  these  pupils,  de¬ 
pending  upon  the  kind  of  visual  aberration. 

5.  24-point  type  is  too  small  for  these  pupils. 

6.  Typewritten  and/or  machine-duplicated  materials  (of  the 
varieties  investigated)  are  as  readable  for  the  partially 
sighted  as  is  printed  material,  or  at  least  are  acceptable  for 
use  with  such  pupils. 

7.  Typewritten  and/or  machine-duplicated  materials  (of  the 
varieties  investigated)  are  not  acceptable  for  use  with  the 
partially  sighted  from  the  standpoint  of  readability. 

SELECTION  OF  TYPES  TO  BE  STUDIED 

Type  size 

With  such  a  range  of  possible  outcomes,  and  such  a  variety  of 
printing  conditions  to  be  studied,  the  immediate  concern  in  the 
formulation  of  the  problem  was  the  selection  of  type  sizes,  type 
style,  and  method  of  reproduction  of  reading  matter  (printing, 
typing,  Mimeography,  etc.)  to  be  investigated.  Aside  from  the 
unlikelihood  of  their  superiority,  type  sizes  larger  than  24-point 
are  rendered  quite  impractical  because  of  their  printing  costs. 
Furthermore,  some  of  the  evidence  points  to  the  possibility  that 
type  sizes  smaller  than  24-point  might  be  acceptable — if  not  pref¬ 
erable — for  many  partially  sighted  children.^^  For  these  reasons, 
24-point  type  was  decided  upon  as  the  largest  type  to  be  studied. 

Other  reasons  led  to  the  decision  to  use  12-point  type  as  the 
lower  limit  of  type  size  to  be  studied.  Most  important  in  this  de¬ 
cision  was  the  unwillingness  o£  the  authorities  in  charge  of  the 
classes  in  which  the  study  was  to  be  made  to  allow  the  pupils  to 
read  type  smaller  than  this.  Then,  too,  research  studies  of  the 
readability  of  type  sizes  for  persons  with  normal  vision  seem  to 

^^Luckiesh,  op.  cit.,  pp.  266-267. 
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indicate  that  the  type  size  which  is  optimum  for  such  persons  is 
10-point  type/^  If  this  is  true,  it  appears  reasonable  to  assume  that 
10-point  type  is  too  small  for  optimum  readability  in  the  case  of 
children  with  handicapped  vision. 

To  provide  for  intermediate  gradations  between  these  two 
rather  arbitrary  extremes  of  12-point  and  24-point  type,  it  was 
decided  to  test  the  readability  of  14-point  and  18-point  type  as 
well.  In  the  event  of  significant  differences  in  the  readability  of 
the  type  sizes  studied,  this  would  provide  some  basis  for  the  de¬ 
termination  of  the  type  size  which  is  optimum  from  the  stand¬ 
point  of  the  increments  in  readability  as  weighed  against  the 
practicalities  of  printing  expense.  In  other  words,  if  it  were  to  be 
found  that  24-point  is  the  more  readable,  but  that  it  is  only  very 
slightly  more  so  than  18-point,  the  large  savings  effected  by 
printing  in  18-point  type  might  well  outweigh  the  slight  advan¬ 
tage  in  readability  held  by  the  larger  type. 

Style  of  type 

As  this  was  to  be  a  study  of  the  readability  of  type  sizes,  it  was 
desirable,  of  course,  to  keep  the  style  of  type  as  uniform  as  pos¬ 
sible.  Furthermore,  it  was  deemed  wise  in  this  study  to  approxi¬ 
mate  as  closely  as  possible  the  actual  conditions  in  sight-saving 
classrooms.  As  has  been  mentioned,  the  type  face  commonly 
used  in  these  classes  is  “Clear  Type.”  This  is  practically  the  same 
as  Linotype  Caslon  No.  3  and  Monotype  Caslon  Bold  No.  79. 
The  12-point  and  14-point  type  sizes,  therefore,  were  set  in  Lino¬ 
type  Caslon  No.  3  (a  Caslon  Bold) .  It  was  originally  intended  to 
set  the  18-point  and  24-point  sizes  in  linotype  also,  but  the  prac- 

Everett  Alderman,  “The  Effects  of  Size  of  Type  on  Speed  of  Reading,  and 
the  Determination  of  Various  Factors  That  May  Influence  the  Results,”  Pitts¬ 
burgh  Schools,  13,  No.  2,  1938.  B.  R.  Buckingham,  “New  Data  on  the  Typog¬ 
raphy  of  Textbooks,”  The  Textbook  in  American  Education,  Thirtieth  Year¬ 
book  of  the  National  Society  for  the  Study  of  Education,  Part  II,  pp.  93-125. 
Public  School  Publishing  Co.,  Bloomington,  Ill.,  1931.  M.  Luckiesh  and  F. 
Moss,  “A  Summary  of  Researches  Involving  Blink-Rate  as  a  Criterion  of  Ease 
of  Seeing,”  Illuminating  Engineering,  35:19-32,  1940.  M.  Tinker  and  D.  Pater¬ 
son,  “Studies  of  Typographical  Factors  Influencing  Speed  of  Reading.  II: 
Size  of  Type,”  Journal  of  Applied  Psychology,  13:120-130,  1929. 
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deal  difficulties  encountered  made  this  impossible.  Consequently, 
these  latter  two  sizes  were  set  in  Monotype  Caslon  Bold  No.  79. 
These  two  types  (Linotype  Caslon  No.  3  and  Monotype  Caslon 
Bold  No.  79)  are  identical  to  the  ordinary  observer  and  only  one 
skilled  in  distinguishing  between  type  styles  could  discern  their 
points  of  difference. 

T ypewriter  type  and  Mimeo graph y 

To  approximate  further  the  actual  conditions  in  sight-saving 
classes,  it  was  decided  to  include  both  enlarged  typewritten  and 
machine-duplicated  materials.  The  machine  used  for  typing  was 
a  special  model  built  by  the  Royal  Typewriter  Company, 
equipped  with  a  type  bearing  the  trade  name  of  Ampli-Type. 
This  type  is  1.25  picas  (5  mm.)  in  height  for  capitals  and  for 
lower  case  letters  with  ascenders  and  descenders.  For  the  lower 
case  letters  without  ascenders  or  descenders,  it  is  .875  picas  (3.5 
mm.)  in  height.  All  strokes  have  a  uniform  thickness  of  approxi¬ 
mately  .5  mm.  This  machine  was  used  to  type  test  materials  and 
to  cut  stencils  for  Mimeographing  test  materials  for  the  study. 
Its  use  was  decided  upon  because  machines  of  this  and  similar 
makes  are  used  in  sight-saving  classes  to  reproduce  in  large  type¬ 
writer  type  materials  that  otherwise  would  be  considered  too 
small  for  the  pupils  in  those  classes  to  read. 

Thus,  the  type  faces  decided  upon  for  investigation  were: 

1.  12-point  Caslon  No.  3,  Linotype 

2.  14-point  Caslon  No.  3,  Linotype 

3.  18-point  Caslon  Bold  No.  79,  Monotype 

4.  24-point  Caslon  Bold  No.  79,  Monotype 

5.  Ampli-Type,  typed 

6.  Ampli-Type,  Mimeographed 

DEFINITION  OF  READABILITY 

Throughout  the  foregoing  discussion,  the  term  “readability’’ 
has  been  used  frequently.  In  the  literature  of  types  and  type 

“Mimeograph”  is  the  trade  name  for  the  duplicating  products  of  the  A.  B. 
Dick  Co.,  Chicago  and  New  York. 
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sizes,  this  term  has  sometimes  been  used  interchangeably  with 
“visibility”  and  “legibility.”  It  has  also  been  used  to  denote  that 
quality  which  renders  reading  matter  pleasurable,  appealing,  or 
easy  to  read  from  the  standpoint  of  comprehension.  As  used  in 
this  report,  the  term  “readability”  is  not  synonymous  with  any 
of  these  terms  or  meanings.  It  refers,  rather,  to  the  extent  to 
which  a  given  type  size  or  form  lends  itself  to  being  read  with 
absence  of  visual  effort — with  least  “cost  to  .  .  .  human  beings 
operating  as  human  seeing  machines”  — when  comprehension 
is  held  relatively  constant.  “Visibility,”  on  the  other  hand,  re¬ 
fers  more  to  the  degree  of  clarity  with  which  an  object  can  be 
distinguished,  and  is  a  function  of  various  conditions  which  also 
affect  readability,  including  illumination,  size  of  object,  distance, 
contrast,  etc.  “Legibility”  generally  connotes  the  idea  of  capa¬ 
bility  of  being  recognized,  particularly  with  respect  to  symbolic 
language,  when  readability  conditions  extraneous  to  the  visual 
object  meet  supra-threshold  conditions.  These  two  conceptions 
are  less  dynamic  than  the  meaning  of  readability  herein  em¬ 
ployed.  The  three  terms  are  not  separate  and  distinct,  however; 
they  are  closely  related.  Readability  implies  both  adequate  visi¬ 
bility  and  legibility,  and  is  the  more  inclusive  of  the  three  con¬ 
cepts.  It  is  largely  a  physiological  concept,  and — as  has  been 
mentioned — bears  an  inverse  relationship  to  visual  effort:  the 
greater  the  visual  effort  necessary  to  read  a  given  printed  passage, 
the  less  readable  is  that  passage  considered  to  be.  This  is  the 
definition  of  readability  which  was  employed  in  this  study. 

SUMMARY 

The  study  herein  reported  was  undertaken  because  of  the 
need  for  more  objective  evidence  on  the  relative  readability  of 
several  type  sizes,  and  of  enlarged  typewritten  and  Mimeo¬ 
graphed  materials.  Present  usage,  as  a  result  of  some  not-too- 
rigidly  controlled  studies  made  in  Cleveland  in  1920,  is  usually 
24-point  Caslon  Bold  type  (“Clear  Type”),  and  materials  en- 

Luckiesh  and  Moss,  loc.  cit.,  p.  12. 
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larged  in  various  ways.  The  need  for  further  evidence  has  been 
explicitly  recognized  by  authorities  in  this  field.  The  types  se¬ 
lected  for  evaluation  in  this  study  were  12-,  14-,  18-,  and  24-point 
types,  and  Ampli-Type,  typed  and  Mimeographed. 

For  the  purpose  of  this  study,  “readability”  is  defined  in  terms 
of  ease  of  seeing  or  “cost  to  the  reader”:  the  greater  the  cost,  the 
less  the  readability  and  vice  versa. 


II 


SELECTION  OF  THE  CRITERIA 

DIFFICULTY  OF  LOCATING  SATISFACTORY  CRITERIA 

To  make  any  study  of  readability  as  herein  defined,  it  is  first 
necessary  to  find  some  criterion  to  measure  the  expenditure  of 
visual  effort,  or  amount  of  visual  fatigue — the  “cost  to  the 
reader.”  An  examination  of  the  literature,  however,  failed  to  lo¬ 
cate  a  criterion  for  the  measurement  of  these  factors  which  is 
universally  agreed  upon  as  being  valid,  reliable,  and  sufficiently 
sensitive. 

Inquiries  to  other  likely  sources,  such  as  the  Harvard  Fatigue 
Laboratory,  and  prominent  research  workers  in  physiology  and 
ophthalmology,  convinced  the  investigator  that  no  entirely  satis¬ 
factory  method  of  measuring  visual  fatigue  has  yet  been  de¬ 
veloped.  One  which  seems  to  hold  some  promise — that  of  speed 
of  accommodation  to  near  and  far  vision  ^ — not  only  is  contro¬ 
versial,^  but  presented  two  obstacles.  First,  the  authorities  in 
charge  of  the  classes  in  which  the  study  was  made  were  unwilling 
— and  justifiably  so — to  allow  the  pupils  to  be  subjected  to  any 
prolonged,  fatiguing  visual  task  such  as  would  be  necessary  to 
produce  significant  changes  in  speed  of  accommodation.  Second, 
the  measure  of  such  accommodation  speed  is  a  technical  task,  for 
which  the  investigator  had  neither  the  training  nor  the  equip¬ 
ment. 

^  C.  J.  Robertson,  “Effect  of  Fatigue  on  the  Adjustment  of  the  Eye  to  Near 
and  Far  Vision,”  Archives  of  Ophthalmology,  17:859-876,  1937.  R.  E.  Wager, 
“A  Method  of  Measuring  Fatigue  of  the  Eyes,”  Journal  of  Educational  Psy¬ 
chology,  13:561-570,  1922. 

^  J.  I.  Kurtz,  “An  Experimental  Study  of  Ocular  Fatigue,”  American  Journal 
of  Optometry,  15:86-117,  1938. 
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This  does  not  mean  that  other  criteria  have  not  been  used  with 
success.  A  number  of  investigators  report  studies  of  the  various 
factors  of  readability,  in  which  they  used  criteria  which  are  rela¬ 
tively  easy  to  apply,  and  which  have  given  positive  results  in 
many  cases. 

METHODS  USED  IN  OTHER  STUDIES  OF 
TYPOGRAPHICAL  FACTORS 

Several  approaches  have  been  made  to  the  study  of  typographi¬ 
cal  factors.  In  general,  there  have  been  three  distinct  types  of 
investigations:  (1)  studies  of  visibility,  measured  in  terms  of 
visual  thresholds;  (2)  studies  measuring  readability  by  amount  of 
work  output;  and  (3)  studies  of  readability  defined  as  “ease  of 
seeing”  or  “cost  to  the  reader.”  Studies  of  the  first  two  types  date 
from  late  in  the  nineteenth  century;  studies  of  the  “cost  to  the 
reader”  are  relatively  recent,  and  largely  confined  to  the  investi¬ 
gations  of  Luckiesh  and  Moss. 

Studies  of  visibility 

Prominent  among  the  investigations  emphasizing  the  visibility 
or  legibility  (the  terms  are  used  quite  interchangeably  in  the 
literature)  of  letters  are  those  of  Roethlein,  Tinker  and  Paterson, 
and  Luckiesh  and  Moss,  although  a  number  of  other  investigators 
report  studies  of  this  nature.  The  studies  reported  mention  sev¬ 
eral  methods  of  appraising  this  factor.  All  of  them,  however,  are 
concerned  with  what  Luckiesh  and  Moss  term  “visual  thresh¬ 
olds.”  These  two  investigators  state  that  visual  thresholds  vary 
with  the  purpose  of  the  visual  task,  and  list  the  following  thresh¬ 
old  levels: 

“1.  Maximum  ease  of  seeing,  or  minimum  cost  of  seeing  to  the 
human  seeing  machine. 

2.  Maximum  performance  of  useful  work,  regardless  of  cost  to 
the  human  seeing  machine. 

3.  Maximum  certainty  of  recognition  of  critical  details. 

4.  Minimum  certainty  of  recognition  of  critical  details. 
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5.  Maximum  certainty  of  distinguishing  presence  of  object. 

6.  Minimum  certainty  of  distinguishing  presence  of  object.”  ^ 

Most  studies  of  the  visibility  or  legibility  of  print  are  concerned 
with  levels  3  and  4.  The  studies  reported,  however,  use  different 
methods  of  appraisal. 

Roethlein  used  the  distance  method  to  study  the  legibility  of 
type  faces.  The  types  studied  were  moved  toward  the  observer 
from  a  distance  at  which  they  were  unrecognizable  by  him,  until 
they  could  be  recognized.^  This  method  was  also  used  by  Web¬ 
ster  and  Tinker,®  Taylor,®  Sanford,^  and  Holmes.® 

Burtt  and  Basch  studied  the  legibility  of  the  isolated  letters  of 
several  type  faces  with  the  use  of  Weiss’s  Focal  Variator.®  This 
apparatus  is  a  system  of  lenses  so  related  to  one  another  that  an 
image  may  be  projected  on  a  ground  glass  screen  in  any  degree 
of  clarity,  from  complete  clearness  to  unrecognizable  blur.^®  The 
measure  of  the  legibility  was  the  degree  of  focus  at  which  the 
projected  letter  could  be  recognized. 

Still  another  method  which  has  been  used  is  that  of  controlled 
tachistoscopic  exposure,  wherein  the  legibility  is  measured  by 
the  minimum  length  of  exposure  necessary  for  recognition.^^ 

^  M.  Luckiesh  and  F.  Moss,  The  Science  of  Seeing,  pp.  46-47.  D.  Van 
Nostrand  Co.,  New  York,  1937. 

^  B.  E.  Roethlein,  “The  Relative  Legibility  of  Different  Faces  of  Printing 
Types,”  American  Journal  of  Psychology,  23:1-36,  1912. 

^  H.  M.  Webster  and  M.  A.  Tinker,  “The  Influence  of  Type  Face  on  the 
Legibility  of  Print,”  Journal  of  Applied  Psychology,  19:43-52,  1935,  and  “The 
Influence  of  Paper  Surface  on  the  Legibility  of  Print,”  Journal  of  Applied 
Psychology,  19:145-147,  1935. 

®  C.  D.  Taylor,  “The  Relative  Legibility  of  Black  and  White  Print,”  Journal 
of  Educational  Psychology,  25:561-578,  1934. 

^  E.  C.  Sanford,  “The  Relative  Legibility  of  Small  Letters,”  American  Journal 
of  Psychology,  1:402-435,  1888. 

*  G.  Holmes,  “The  Relative  Legibility  of  Black  and  White  Print,”  Journal 
of  Applied  Psychology,  15:248-251,  1931. 

^  H.  E.  Burtt  and  C.  Basch,  “The  Legibility  of  Bodoni,  Baskerville  Roman, 
and  Cheltenham  Type  Faces,”  Journal  of  Applied  Psychology,  7:237-245,  1923. 

^°A.  P.  Weiss,  “The  Focal  Variator,”  Journal  of  Experimental  Psychology, 
2:106-113,  1917. 

R.  Miyake,  J.  W.  Dunlap,  and  E.  E.  Cureton,  “The  Comparative  Legibility 
of  Black  and  Colored  Numbers  on  Colored  and  Black  Backgrounds,”  Journal  of 
General  Psychology,  3:340-343,  1930. 
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The  most  recent  method  of  studying  type  visibility  is  that  of 
Luckiesh  and  Moss.  They  have  developed  a  Visibility  Meter, 
which  they  describe  as  an  instrument  consisting  “essentially  of 
two  colorless  photographic  filters  with  precise  gradients  in  den¬ 
sity,  which  may  be  rotated  in  front  of  the  eyes  while  looking  at 
an  object  or  performing  a  visual  task.  These  gradient  filters  not 
only  reduce  the  apparent  brightness  of  the  visual  field  due  to 
absorption,  but  also  lower  the  contrast  between  the  object  of 
regard  and  its  background,  due  to  a  scattering  of  light  by  the 
photographic  film.”  The  “gradient  filters”  of  this  instrument 
are  simply  strips  of  photographic  film  ranging  in  density  from 
colorless  transparency  to  virtual  opacity.  The  object  being 
studied  is  viewed  through  these  filters,  with  effects  similar  to 
those  which  would  be  attained  by  looking  through  smoked 
glasses  of  varying  densities.  The  feature  of  the  instrument  is 
that  the  degrees  of  density  have  been  calibrated  and  standardized 
in  terms  of  standard  illumination  intensities  and  visual  test  ob¬ 
jects. 

Using  this  instrument,  it  is  possible  to  get  either  “absolute” 
visibility  ratings  in  terms  of  the  readings  on  the  instrument,  or 
“relative”  visibility  ratings,  expressed  in  per  cent  of  visibility 
when  a  given  condition  (a  given  type  size,  illumination  level, 
type  style,  etc.)  is  taken  as  standard,  or  100  per  cent.  The  read¬ 
ings  can  be  interpreted  in  terms  of  illumination  intensity  neces¬ 
sary  to  equalize  the  visibility  of  test  objects  with  that  of  the 
standard,  or  in  terms  of  changes  in  size  necessary  to  achieve  that 
end. 

The  relative  effects  of  many  of  the  variables  affecting  reada¬ 
bility  have  been  studied  by  these  two  investigators  with  the  use 
of  the  Visibility  Meter. 

M.  Luckiesh  and  F.  Moss,  “The  Visibility  and  Readability  of  Print  on 
White  and  Tinted  Papers,”  Sight-Saving  Review,  8:123-134,  1938.  See  also 
M.  Luckiesh,  and  F.  Moss,  “Supra-Threshold  Visibility,”  Journal  of  the  Optical 
Society,  30:62-69,  1940;  “Visibility  and  Ease  of  Seeing,”  Industrial  Medicine, 
9:33,  1940;  and  “Visibility:  Its  Measurement  and  Significance  in  Seeing.”  Jour¬ 
nal  of  the  Franklin  Institute,  220:431,  1935. 
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Studies  of  work  output  as  a  measure  of  readability 

Some  investigators  have  attempted  a  more  dynamic  approach 
to  the  problem  than  that  which  is  implied  in  the  concept  of 
visibility.  Many  studies  are  reported  wherein  readability  is  de¬ 
termined  by  “work  output,”  which  refers  here  to  the  amount  of 
material  which  can  be  read  in  a  given  time.  The  foremost  ex¬ 
ponents  of  this  technique  are  Tinker  and  Paterson.  These  two 
investigators  studied  not  only  the  readability  of  type  sizes, but 
many  other  factors  affecting  speed  of  reading  as  well.  Short 
speed  tests  with  definite  time  limits  were  used,  and  readability 
was  measured  in  terms  of  the  amount  of  material  read  within 
these  time  limits. 

Speed  of  production  (i.e.,  reading  speed)  has  been  used  as  a 
criterion  of  readability  by  many  other  investigators  also.  Baird,^^ 
Blackhurst,^®  Buckingham,^^  Gilliland,^^  Greene,^®  Grifhng  and 
Franz,^^  and  Hovde,^’’  report  studies  in  which  they  utilized  it  to 
appraise  the  effects  on  readability  of  a  number  of  typographical 
conditions. 

In  general,  investigators  seem  to  agree  that  work  output  as  a 
criterion  of  readability  is  not  as  sensitive  as  could  be  desired. 
Luckiesh  and  Moss,  the  foremost  critics  of  the  method,  have 
used  it  in  conjunction  with  their  other  procedures,  and  report 


M.  A.  Tinker  and  D.  G.  Paterson,  “Studies  of  Typographical  Factors  In¬ 
fluencing  Speed  of  Reading:  II,  Size  of  Type,”  Journal  of  Applied  Psychology, 
13:120-130,  1929. 

W.  Baird,  “The  Legibility  of  a  Telephone  Directory,”  Journal  of  Ap¬ 
plied  Psychology,  1:30-37,  1917. 

J.  H.  Blackhurst,  Investigations  in  the  Hygiene  of  Reading.  Warwick  and 
York,  Baltimore,  1927. 

^®B.  R.  Buckingham,  “New  Data  on  the  Typography  of  Textbooks,”  The 
Textbook  in  American  Education,  Thirtieth  Yearbook  of  the  National  Society 
for  the  Study  of  Education,  Part  II,  pp.  93-125,  1931. 

^■^A.  R.  Gilliland,  “The  Effect  on  Reading  of  Changes  in  Size  of  Type,” 
Elementary  School  Journal,  24:138-146,  1923. 

^®E.  B.  Greene,  “The  Relative  Legibility  of  Linotyped  and  Typewritten 
Material,”  Journal  of  Applied  Psychology,  18:697-704,  1934. 

H.  Grifling  and  S.  I.  Franz,  “On  the  Conditions  of  Fatigue  in  Reading,” 
Psychological  Review,  3:513-530,  1896. 

H.  F.  Hovde,  “The  Relative  Effects  of  Size  of  Type,  Leading  and  Context,” 
Journal  of  Applied  Psychology,  13:600-629  and  14:63-73,  1929  and  1930. 
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that  they  are  firmly  convinced  that  it  is  a  poor  indicator  of  ease 
of  seeing.  At  one  point  for  example,  they  make  the  statement 
that: 

Speed  of  performing  work  is  a  relatively  insensitive  and  sometimes 
misleading  criterion  of  ease  or  difficulty  of  seeing.  .  .  .  Even 
though  the  ocular  muscles  become  fatigued  from  long  periods  of 
reading,  it  has  been  shown  that  they  continue  to  function  effectively 
at  the  expense  of  greater  effort  on  the  part  of  the  reader.  Thus  meas¬ 
urements  of  the  ability  of  the  visual  mechanism  to  perform  a  visual 
task  or  test  usually  do  not  reveal  significant  changes  in  the  quality  of 
their  performances  even  after  prolonged  exertion.  Apparently  the 
brain  is  a  hard  taskmaster  for  the  organs  of  vision,  since  clear  binocu¬ 
lar  vision  is  demanded  and  obtained  under  extreme  conditions.^^ 

As  their  chief  objection  to  reading  speed  as  a  criterion  is  its 
insensitivity,  these  investigators  do  concede  that  “if  a  statistically 
significant  difference  in  the  rate  of  reading  different  materials  is 
found,  it  may  be  safely  assumed  that  the  materials  are  different 
in  readability.”  It  is  their  contention  that,  since  extreme  dif¬ 
ferences  in  readability  would  be  necessary  to  affect  speed  of 
reading,  some  more  sensitive  criterion  is  necessary  for  finer 
discriminations. 

Studies  of  cost  to  the  reader 

The  third  method  of  study,  that  of  measuring  ease  of  seeing 
or  “cost  to  the  reader,”  has  been  used  almost  exclusively  by 
Luckiesh  and  Moss.  There  is  a  good  reason  for  this.  As  was 
pointed  out  at  the  beginning  of  this  chapter,  there  appears  to  be 
no  criterion  of  visual  effort  or  fatigue  which  authorities  agree 
upon  as  being  both  valid  and  reliable.  Among  those  criteria 
which  have  been  suggested  and  applied  are:  changes  in  ampli¬ 
tude  of  accommodation,^^  ergographic  records  of  fatigue  of  con- 

M.  Luckiesh  and  F.  Moss,  “Visibility  and  Readability  of  Print  on  White 
and  Tinted  Papers,”  Sight-Saving  Review,  8:123-134,  1938. 

M.  Luckiesh  and  F.  Moss,  “The  Readability  of  Stencil-Duplicated  Mate¬ 
rials,”  Sight-Saving  Review,  9:295-304,  1939. 

J.  I.  Kurtz,  “An  Experimental  Study  of  Ocular  Fatigue,”  American  Journal 
of  Optometry,  15:86-117,  1938  and  R.  E.  Wager,  “A  Method  of  Measuring 
Fatigue  of  the  Eyes,”  Journal  of  Educatiojial  Psychology,  13:561-570,  1922. 
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vergence,^^  changes  in  size  of  the  pupil, metabolism  and  pulse 
rate  changes,^®  and  rate  of  involuntary  eye-blink.^^  With  the  ex¬ 
ception  of  eye-blink,  none  of  these  has  been  validated  by  re¬ 
peated  use  in  experiments  to  any  appreciable  extent.  Indeed, 
the  investigations  of  metabolism  and  pulse  rate,  and  of  amplitude 
of  accommodation  seem  to  indicate  that  these  criteria  are  not 
satisfactory. 

On  the  other  hand,  the  rate  of  involuntary  eye-blink,  which 
Luckiesh  and  Moss  strongly  endorse  as  a  criterion,  is  reported  by 
them  to  be  quite  sensitive  to  those  changes  which  axiomatically 
would  demand  increased  visual  effort.^®  Blink  rate,  as  a  criterion 
of  visual  effort  being  expended,  or  of  visual  fatigue,  has  been 
used  extensively  by  these  two  investigators,  although  no  others 
appear  to  have  used  it  to  any  extent.  Working  from  the  hy¬ 
pothesis  of  Ponder  and  Kennedy  that  blink  rate  is  closely  related 
to  mental  tension,  and  that  the  eyelid  movements  “constitute  a 
kind  of  relief  mechanism  whereby  nervous  energy  otherwise 
unutilized,  passes  into  a  highly  facilitated  path,”  they  developed 
the  theory  that  the  rate  of  blinking  is  indicative  of  visual  effort 
and  fatigue.  They  selected  many  visual  tasks  which  they  con¬ 
sidered  axiomatically  fatiguing,  and  counted  the  number  of 
blinks  made  by  subjects  performing  these  and  correlative  tasks 
less  fatiguing  in  character.  They  found  blink  rate  to  increase 
without  jail  with: 


M.  Luckiesh  and  F.  Moss,  “Fatigue  of  Convergence  Induced  by  Reading 
as  a  Function  of  Illumination  Intensity,”  American  Journal  of  Ophthalmology, 
18:319-323,  1935. 

M.  Luckiesh  and  F.  Moss,  “Size  of  Pupil  as  a  Possible  Index  for  Visual 
Fatigue,”  American  Journal  of  Ophthalmology,  16:393-396,  1933. 

2®R.  McFarland,  C.  Knehr,  and  C.  Berens,  “Metabolism  and  Pulse-Rate  as 
Related  to  Reading  under  High  and  Low  Levels  of  Illumination,”  Journal  of 
Experimental  Psychology,  25:65-75,  1939. 

M.  Luckiesh  and  F.  Moss,  “The  Eyelid  Reflex  as  a  Criterion  of  Ocular 
Fatigue,”  Journal  of  Experimental  Psychology,  20:589-596,  1937. 

M.  Luckiesh  and  F.  Moss,  “Criteria  of  Readability,”  Journal  of  Experi¬ 
mental  Psychology,  27:256-270,  1940. 

^®E.  Ponder  and  W.  P.  Kennedy,  “On  the  Act  of  Blinking,”  Quarterly 
Journal  of  Experimental  Physiology,  18:89-110,  1927-1928. 
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1.  An  increase  in  the  duration  of  the  reading  period. 

2.  The  introduction  of  an  improper  visual  correction  while 
reading. 

3.  The  addition  of  a  glare-source  within  the  visual  field. 

4.  A  marked  decrease  in  level  of  illumination  on  reading  mat¬ 
ter. 

5.  A  decrease  to  below  normal  (assuming  “normal”  to  be 
about  10-point  to  12 -point  type)  in  size  of  type  read. 

6.  A  decrease  from  6  points  in  leading  of  type  read. 

7.  A  marked  decrease  in  boldness  of  type  read. 

8.  The  use  of  background  papers  of  saturated  colors. 

In  a  summary  of  the  findings  of  these  investigations,  Luckiesh 
and  Moss  state  that  “the  criterion  of  blinking,  when  used  under 
proper  experimental  conditions,  appears  to  be  an  unusually  sensi¬ 
tive  one.”  Elsewhere,  they  state  that: 

It  is  becoming  increasingly  certain  from  numerous  investigations 
which  we  have  made  during  recent  years  that  the  relative  ease  with 
which  different  materials  are  read  may  be  revealed  by  observing  the 
rate  of  involuntary  blinking  while  reading  under  controlled  condi¬ 
tions.  ...  In  addition,  there  is  a  wealth  of  evidence  from  the 
sciences  of  physiology  and  psychology  which  reveals  the  significance 
of  this  criterion  from  a  fundamental  viewpoint  of  ocular  strain  and 
fatigue.  In  brief,  this  criterion  of  ease  of  seeing  possesses  the  ad¬ 
vantages  of  fundamental  significance,  simplicity  in  application,  ob¬ 
jectiveness,  and  satisfactory  sensitivity.  The  consistently  successful 
results  obtained  in  applying  this  criterion  confirm  our  general  theory 
that  ease  of  seeing  is  more  readily  and  certainly  revealed  by  involun¬ 
tary  reactions  than  it  is  by  criteria  of  performance  over  which  the 
subject  exercises  voluntary  control.  Speed  of  reading  may  be  cited 
as  an  example  of  the  latter.^^ 

It  is  to  be  noted  that  these  investigators  quite  consistently 
speak  of  blink  rate  as  a  criterion  of  “ease  of  seeing,”  and  only 
occasionally  as  a  criterion  of  ocular  fatigue.  From  their  investi¬ 
gations,  it  seems  apparent  that  they  consider  this  criterion  sensi- 

M.  Luckiesh  and  F.  Moss,  “A  Summary  of  Researches  Involving  Blink- 
Rate  as  a  Criterion  of  Ease  of  Seeing,”  Illuminating  Engineering,  35:31.  1940. 

M.  Luckiesh  and  F.  Moss,  “The  Readability  of  Stencil-Duplicated  Mate¬ 
rials,”  Sight-Saving  Review,  9:295-304,  1939. 
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tive  to  increases  in  the  expenditure  of  visual  effort,  even  before 
an  appreciable  amount  of  fatigue  is  induced.  For  example,  the 
statement  is  made  in  one  of  their  reports,  that  “the  rate  of  blink¬ 
ing  while  performing  critical  visual  tasks  is  a  function  of  both 
the  duration  and  severity  of  the  specific  visual  task;  and  ...  an 
augmentation  of  either  of  these  factors  increases  the  rate  of 
blinking.”  If  this  is  true,  one  need  not  demand  that  the  sub¬ 
ject  perform  a  task  to  the  exhaustion  point,  nor  even  long 
enough  for  fatigue  to  become  evident,  in  order  to  measure  the 
relative  ease  with  which  a  given  size  of  type  is  read.  If  a  change 
in  type  size  results  in  an  augmentation  of  the  severity  of  the 
reading  task,  it  should  be  reflected  directly  in  an  increase  in  eye- 
blink  rate. 


CRITERIA  TO  BE  USED  IN  THIS  STUDY 

As  Luckiesh  and  Moss  have  used  this  criterion  of  eye-blink 
rate  in  many  investigations  of  seeing  ease,  described  in  scores  of 
articles,  and  report  that:  “Rate  of  blinking  invariably  increased 
as  the  task  was  made  more  difficult  in  various  ways,”  and  as 
there  is  no  reported  evidence  disputing  its  validity,  it  appeared 
to  be  an  ideal  criterion  for  this  investigation. 

Nevertheless,  as  other  investigators  report  positive  results  from 
the  use  of  speed  and  accuracy  of  comprehension  in  investigations 
of  readability,  it  was  decided  to  use  both  reading  speed  and  eye- 
blink  rate  as  criteria  of  readability  in  this  study. 

Furthermore,  since  Luckiesh  and  Moss  claim  that  visibility,  as 
measured  by  their  Visibility  Meter,  correlates  well  with  reada¬ 
bility,  it  was  decided  to  measure  with  that  instrument  the  visi¬ 
bility  of  the  materials  used  in  this  study.  The  relative  visibility 
of  the  types  studied  could  then  be  compared  with  their  reada¬ 
bility,  as  revealed  by  reading  speed  and  eye-blink  rate. 

M.  Luckiesh  and  F.  Moss,  “The  Eyelid  Reflex  as  a  Criterion  of  Ocular 
Fatigue,”  Journal  of  Experimental  Psychology,  20:589-596,  1937. 

M.  Luckiesh  and  F.  Moss,  “A  Summary  of  Researches  Involving  Blink- 
Rate  as  a  Criterion  of  Ease  of  Seeing,”  Illuminating  Engineering,  35:31,  1940. 
Italics  are  author’s. 
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SUMMARY 

Efforts  to  determine  a  criterion  of  the  readability  of  printed 
matter  led  to  the  discovery  that  no  criterion  has  been  established 
universally  as  being  valid  for  this  purpose.  The  search  revealed 
that  three  approaches  have  been  made  to  the  problem  of  deter¬ 
mining  the  characteristics  of  printed  matter  which  make  for  the 
optimum  in  readability:  (1)  studies  of  visibility,  measured  in 
terms  of  visual  thresholds;  (2)  studies  of  readability  measured 
by  speed  of  work  output;  and  ( 3 )  studies  of  readability  measured 
in  terms  of  cost  to  the  reader.  Numerous  investigators  report 
studies  (for  persons  of  normal  vision)  of  the  first  two  types, 
using  a  number  of  devices  and  methods.  In  the  field  of  reading, 
studies  of  work  output  have  generally  been  measurements  of 
reading  speed. 

Studies  of  readability  in  terms  of  cost  to  the  reader  have  been 
confined  largely  to  the  extensive  investigations  of  Luckiesh  and 
Moss,  in  which  rate  of  involuntary  eyelid  reflex  has  been  the  cri¬ 
terion.  Other  criteria  have  been  investigated,  some  of  which 
have  been  demonstrated  to  be  definitely  undesirable;  others  have 
not  been  validated  to  any  appreciable  extent.  Since  Luckiesh 
and  Moss  claim  that  eye -blink  rate  invariably  increases  with  an 
increase  in  the  severity  of  the  visual  task,  this  appeared  to  be  an 
ideal  criterion  for  the  purpose  of  the  present  study.  Reading 
speed  has  also  given  positive  results  in  a  number  of  studies  by 
different  experimenters;  consequently  it  also  was  included  as  a 
criterion.  In  addition,  it  was  decided  to  measure  visibility  with 
the  Luckiesh-Moss  Visibility  Meter,  to  make  possible  a  compari¬ 
son  of  this  factor  with  the  readability  revealed  by  the  other  cri¬ 
teria. 


Ill 


CONDITIONS  OF  THE  EXPERIMENT 

LENGTH  OF  THE  SAMPLING  PERIODS 

Even  had  it  been  possible  to  use  a  criterion  of  readability  which 
would  be  universally  agreed  upon  as  being  valid  and  reliable,  the 
question  of  the  length  of  testing  time  which  is  necessary  to  assure 
a  practical  optimum  of  reliability  for  the  criterion  scores  would 
still  remain.  In  this  respect  there  were  limiting  factors.  Here,  too, 
those  responsible  for  administering  the  education  of  the  partially 
sighted  children  being  studied  were  quite  understandably  unwill¬ 
ing  to  allow  the  children  to  read  for  more  than  very  brief  periods, 
particularly  in  reading  the  types  smaller  than  18  points  in  size. 
Furthermore,  it  will  be  remembered  that  six  type  variations  were 
investigated.  The  pattern  of  the  experiment  (see  page  42)  re¬ 
quired  as  many  test  forms  as  there  were  type  variations;  namely, 
six.  However,  it  is  difficult  to  find  six  reading  passages  of  equiva¬ 
lent  difficulty,  or  suitable  tests  with  six  equivalent  forms,  of  suf¬ 
ficient  length  to  yield  more  than  a  few  minutes’  sampling  of  the 
criteria. 

Other  investigators,  however,  have  obtained  positive  results 
with  sampling  periods  of  only  a  few  minutes.  Luckiesh  and 
Moss,  in  particular,  state  that:  “In  general,  our  experiences  indi¬ 
cate  that  a  period  of  five  minutes  is  adequate,  although  in  some 
cases  it  is  advisable  to  repeat  the  series  of  measurements  and  use 
the  mean  value  for  interpretation.”  ^  As  was  observed  on  page 
1 6,  this  would  seem  to  be  particularly  true  in  the  case  of  the  eye- 


^  M.  Luckiesh  and  F.  Moss,  “Frequency  of  Blinking  as  a  Clinical  Criterion  of 
Ease  of  Seeing,”  American  Journal  of  Ophthalmology,  22:616-621,  1939. 
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blink  criterion,  which — according  to  Luckiesh  and  Moss — di¬ 
rectly  reflects  the  relative  severity  of  the  visual  task. 

Tinker  and  Paterson  used  even  shorter  periods  in  studying 
readability,^  using  a  time  span  of  only  1 54  minutes.  It  will  be  re¬ 
called  that  their  studies  used  reading  speed  as  an  indicator  of 
readability. 

On  these  bases,  therefore,  it  was  decided  that  each  subject 
should  read  each  type  variation  for  a  period  of  five  minutes. 

SELECTION  OF  READING  TESTS  TO  BE  USED 

Criteria  to  be  met 

In  selecting  reading  tests  to  be  used  in  a  study  of  this  kind, 
there  are  several  criteria  to  be  met.  The  first  is  of  obvious  im¬ 
portance:  the  technique  of  the  test  must  be  such  as  to  insure  that 
the  material  is  read.  This  necessitates  some  kind  of  objective 
check  on  comprehension.  Second,  it  is  desirable  to  have  no  com¬ 
prehension  difficulties  of  any  consequence  in  the  reading  matter 
of  the  test  materials.  This  is  necessary  to  insure  a  rate  score  with 
accuracy  as  a  negligible  factor,  so  as  to  allow  the  influence  of 
type  size  to  exert  its  maximum  effect.  Third,  the  reliability  of 
the  test  forms  used  must  be  sufficiently  high  to  assure  the  mean¬ 
ingfulness  of  the  group  comparisons.  In  this  respect,  Lindquist 
states  that:  “A  reliability  as  low  as  .40  may  be  adequate  for  com¬ 
parisons  of  average  scores  for  large  groups  of  individuals.”  ^ 

A  fourth  criterion,  peculiar  to  this  experiment,  is  that  the  test 
must  provide  reading  material  sufficient  to  fill  a  five-minute 
period,  which  had  been  decided  upon  as  the  time  span  of  each 
sample. 

For  the  conditions  of  this  experiment,  it  was  also  necessary 
that  the  test  have  six  fairly  equivalent  forms,  one  for  each  size  of 
type  to  be  tested. 

^  M.  A.  Tinker  and  D.  G.  Paterson,  “Studies  of  Typographical  Factors  In¬ 
fluencing  Speed  of  Reading,  II:  Size  of  Type,”  Journal  of  Applied  Psychology, 
13:120-130,  1929. 

®  E.  F.  Lindquist,  A  First  Course  in  Statistics,  pp.  201-202.  Houghton  Mifflin 
Company,  Boston,  1938. 
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The  tests  selected 

After  an  examination  and  consideration  of  a  number  of  kinds 
of  tests,  it  was  decided  that  the  speed  of  reading  tests  developed 
by  Gates  ^  were  the  most  practical  for  this  study.  These  tests  are 
made  up  of  short  paragraphs  of  from  25  to  30  words.  Following 
each  item  are  four  words,  one  of  which  is  the  correct  answer  to 
the  question  asked  or  implied  in  the  paragraph.  The  person  being 
tested  is  to  underline  the  correct  answer.  An  example  of  these 
items  is  the  paragraph  below,  selected  at  random  from  Form  A 
of  the  tests  as  they  were  printed  for  this  experiment: 

47.  The  Indians  brought  the  Pilgrims  presents  of  baskets,  corn 

and  furs.  They  helped  the  Pilgrims  and  taught  them  many 

things  they  needed  to  know.  What  kind  of  Indians  were  they.^ 
sleepy  friendly  bad  warlike 

It  is  evident  that  the  comprehension  of  each  item  in  the  test  is 
checked  as  it  is  read,  satisfying  the  first  criterion. 

The  difficulty  level  of  the  paragraphs  in  these  tests  is  approxi¬ 
mately  third  grade.  As  steps  were  taken  to  insure  that  each  sub¬ 
ject  for  the  experiment  had  a  reading  ability  above  third  grade 
level,  the  second  criterion  of  the  negligibility  of  comprehension 
difficulties  is  met. 

Reliability  of  the  tests 

It  was  earlier  implied  that  the  reliability  coefficient  of  the  test 
forms  used  must  be  at  least  .40,  preferably  better.  Gates  reports 
the  correlations  between  the  two  forms  of  the  Reading  Survey 
to  be  from  .87  to  .90.®  These  correlations  were  obtained  by  com¬ 
puting  the  correlations  of  the  two  forms  with  each  other.  For 
the  Modern  School  Achievement  Tests,  the  coefficients  of  re- 

^A.  I.  Gates,  Modern  School  Achievement  Tests,  Short  Form  (Speed  of 
Reading)  Forms  I  and  II.  Bureau  of  Publications,  Teachers  College,  Columbia 
University,  New  York,  1931  and  Gates  Reading  Survey  (Speed  Test),  Forms  I 
and  II.  Bureau  of  Publications,  Teachers  College,  Columbia  University,  New 
York,  1939. 

®A.  I.  Gates,  Manual  of  Directions  for  Gates  Reading  Survey,  pp.  13-14. 
Bureau  of  Publications,  Teachers  College,  Columbia  University,  New  York. 
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liability  are  on  the  order  of  .87.®  It  can  be  seen  that  in  this  respect 
the  tests  are  more  than  satisfactory. 

Adaptation  of  the  tests  to  this  study 

With  respect  to  the  criterion  of  number  of  forms  (it  will  be 
remembered  that  six  were  necessary)  and  test  length,  these  tests 
seemed  to  fail  to  meet  the  requirements.  To  begin  with,  there 
are  but  four  forms  in  print,  comprising  altogether  228  test  items. 
As  these  items  are  all  closely  equivalent  in  reading  difficulty,  it 
was  thought  possible  to  redistribute  them  randomly  in  such  a 
manner  as  to  make  six  test  forms  of  38  items  each.  Dr.  Gates 
agreed  that  this  was  a  defensible  procedure. 

It  was  not  at  all  certain,  however,  that  the  better  readers  might 
not  finish  this  number  of  items  before  the  termination  of  the  five- 
minute  sampling  period.  Consequently,  the  investigator  wrote 
seventy-two  additional  items,  identical  in  nature  with  those  in 
the  published  tests,  and  closely  equivalent  in  difficulty.  These 
items  were  distributed  randomly  throughout  the  six  test  forms, 
which  then  consisted  of  50  items  each.  The  investigator  wished 
to  verify  his  own  judgment  as  to  the  similarity  of  these  “home¬ 
made”  items  to  those  of  Gates’s  tests.  Consequently,  he  asked 
several  persons  practiced  in  the  use,  analysis,  and  construction 
of  reading  tests  to  examine  the  tests  in  which  these  items  were 
scattered,  and  to  try  to  distinguish  the  items  written  by  the 
investigator  from  those  of  the  original  tests.  In  no  case  were  the 
newly  written  items  distinguishable  from  the  others,  with  respect 
to  authorship  or  difficulty. 

Because  of  the  fact  that  the  new  items  were  so  similar  in 
nature  and  reading  difficulty  to  the  original  test  items,  it  is 
extremely  unlikely  that  they  should  materially  affect  the  relia¬ 
bility  coefficients  of  those  tests.  Particularly  is  it  unlikely  that 
the  coefficients  would  be  affected  so  greatly  as  to  render  the 

tests  useless  for  purposes  of  group  comparison. 

^Manual  of  Directions  for  Modern  School  Achievement  Tests,  Short  Form, 
p.  1.  Bureau  of  Publications,  Teachers  College,  Columbia  University,  New 
York. 
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The  tests  finally  decided  upon,  then,  were  six  forms  of  50  items 
each,  the  items  randomly  assigned  to  the  forms  from  the  Gates 
Reading  Survey,  The  Modern  School  Achievement  Test  of 
Reading  Speed,  and  items  written  by  the  investigator. 

TYPOGRAPHICAL  CONDITIONS  OF  THE  TESTS 

Conditions  of  the  printed  tests 

In  deciding  upon  the  typographical  conditions  (other  than 
size  and  style  of  type)  of  the  tests,  an  effort  was  made  to  make 
them  consistent  with  the  most  enlightened  practice.  Even  for 
persons  with  normal  vision,  however,  the  optimum  for  length 
of  line,  amount  of  leading,  spacing,  etc.,  is  not  agreed  upon  by 
investigators.  Thus  it  is  evident  that  the  specifications  for  these 
conditions  could  not  be  supported  by  scientific  evidence.  This 
is  explained  in  part  by  the  complex  of  interrelationships  among 
these  factors,  and  the  influence  of  habit  on  the  part  of  readers. 
For  example,  the  optimum  for  leading  (space  between  lines)  and 
line  length  varies  with  the  size  and  style  of  type.  This  may  be 
affected,  also,  by  the  fact  that  the  reader  is  used  to  reading  a 
given  type  size  set  in  lines  of  a  given  length,  and  with  a  given 
leading.^ 

The  only  alternative  was  to  rely  upon  expert  advice  from 
authoritative  sources.  Consequently,  a  number  of  authorities 
versed  in  the  most  approved  practices  in  typography  were  con¬ 
sulted.  Among  them  were  an  executive  of  a  type  manufacturing 
company,  an  executive  in  the  manufacturing  department  of  a 
prominent  publishing  house,  a  book  designer  for  a  well-known 
book  manufacturer,  and  an  instructor  for  an  established  printing 
school.®  It  is  believed,  therefore,  that  the  conditions  of  leading, 

spacing,  line  length,  margins,  placement  of  items,  and  general 

^  A.  I.  Gates,  “What  Do  We  Know  about  Optimum  Lengths  of  Lines  in 
Reading?”  Journal  of  Educational  Research,  23:1-7,  1931. 

®  Those  who  were  particularly  helpful  in  this  respect  were:  Mr.  Harry  L. 
Gage,  Vice-President,  Mergenthaler  Linotype  Company;  Mr.  Ernst  Reichl, 
designer,  H.  Wolff  Book  Manufacturing  Company;  Mr.  John  Benbow,  Head 
of  the  Manufacturing  Department,  Longmans,  Green  and  Company;  and  Mr. 
Thomas  J.  O’Connell,  Instructor,  New  York  School  of  Printing. 
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allover  arrangements  of  the  tests  are  in  accordance  with  best 
printing  practice. 

The  general  conditions  with  respect  to  these  factors  for  each 
printed  type  size  were  as  follows: 

12 -point  Caslon  Bold  No.  3,  Linotype 

1.  24-pica  line  length  (111.5  mm.,  or  4  in.). 

2.  Approximately  45  characters  to  a  24-pica  line,  right-hand 
margin  unjustified. 

3.  Identifying  numbers  flush  with  left-hand  margin. 

4.  Cast  on  16-point  body  (equivalent  of  4  points’  leading). 

5.  In  each  item,  one  em  between  the  period  after  the  identi¬ 
fying  number  and  the  first  character  of  the  item. 

6.  One  blank  slug  (1.5  picas)  between  the  body  of  the  item 
and  the  four  response  words. 

7.  Two  blank  slugs  between  the  response  words  of  one  item 
and  the  beginning  of  the  next  item. 

8.  Two  ems  between  the  response  words  of  an  item,  and  the 
group  of  four  words  centered  on  the  page. 

9.  Five  items  to  a  page  (13  blank  slugs  and  20  to  25  lines  of 
print) . 

14-point  Caslon  Bold  No.  3,  Linotype 

1.  30-pica  line  length  (127  mm.,  or  5  in.). 

2.  Approximately  50  characters  to  a  30-pica  line,  right-hand 
margin  unjustified. 

3.  Identifying  numbers  flush  with  left-hand  margin. 

4.  Cast  on  18-point  body  (equivalent  to  four  points  of  lead- 
ing). 

5.  One  blank  slug  (1.5  picas)  between  the  body  of  the  item 
and  the  four  response  words. 

6.  Two  blank  slugs  between  the  response  words  of  one  item 
and  the  beginning  of  the  next  item. 

7.  Two  ems  between  the  response  words  of  an  item,  and  the 
group  of  four  words  centered  on  the  page. 
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8.  Four  items  to  a  page  (10  blank  slugs  and  16  to  20  lines  of 

18-point  Caslon  Bold  No.  79,  Monotype 

1.  36-pica  line  length  (152.5  mm.,  or  6  in.). 

2.  Approximately  45  characters  to  a  36-pica  line,  right-hand 
margin  unjustified. 

3.  Identifying  numbers  flush  with  the  left-hand  margin. 

4.  Six  points  of  leading  (equivalent  to  being  cast  on  24-point 

5.  Two  picas  between  the  body  of  the  item  and  the  four  re¬ 
sponse  words. 

6.  Four  picas  between  the  response  words  of  one  item  and  the 
first  line  of  the  next  item. 

7.  Two  ems  between  the  response  words  of  an  item,  and  the 
group  of  four  words  centered  on  the  page. 

8.  Three  items  to  a  page. 

24-point  Caslon  Bold  No.  79,  Monotype 

1.  40-pica  line  length  (169  mm.,  or  6%  in.). 

2.  Approximately  40  characters  to  a  40-pica  line,  right-hand 
margin  unjustified. 

3.  Identifying  numbers  flush  with  the  left-hand  margin. 

4.  Six  points  of  leading  (equivalent  to  being  cast  on  30-point 
body). 

5.  5,  6,  7,  and  8  same  as  for  the  18-point. 

Conditions  of  the  typed  and  Mimeographed  tests 

The  style  and  size  of  type  for  the  typewritten  tests  have  been 
described  in  Chapter  I.  All  the  typing  was  done  by  the  same 
person,  an  experienced  and  expert  typist.  Standards  for  the  work 
were  very  high,  no  errors  or  erasures  being  allowed.  As  soon  as 
the  typing  showed  signs  of  becoming  appreciably  lighter,  the 
ribbon  was  changed.  The  ribbon  used  throughout  was  Vogue 
Black,  No.  52  Bulletin  Typewriter  Ribbon,  sold  by  the  Royal 
Typewriter  Company.  Even  with  these  precautions,  however, 
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the  typing  on  some  pages  was  heavier  than  that  on  others,  as  is 
revealed  in  the  visibility  readings  in  Table  1. 

The  ratings  in  Table  1  are  relative  values.  They  were  arrived 
at  by  arbitrarily  assigning  to  the  most  “visible”  type  (24-point 
Caslon)  a  rating  of  100,  and  expressing  the  Visibility  Meter 
readings  for  the  other  types  as  percentages  of  the  visibility  read¬ 
ing  for  24-point  Caslon. 


Table  1 

RELATIVE  VISIBILITY  RATINGS  OF  THE  EXPERIMENTAL  TYPES 
(IN  TERMS  OF  24-POINT  CASLON,  ARBITRARILY  ASSIGNED  A 
RELATIVE  VISIBILITY  VALUE  OF  100)* 


Type 


Rating 


12-pomt  Caslon  .  47 

14-point  Caslon  .  58 

18-point  Caslon  . ; . .  85 

24-point  Caslon  .  100 

Ampli-Type  (typed)  light  .  81 

Ampli-Type  (typed)  dark  .  91 

Ampli-T)^e  (Mimeographed)  .  71 


*  Obtained  by  using  the  Luckiesh-Moss  Visibility  Meter,  averaging  the  re¬ 
sults  of  the  readings  of  adult  observers  of  normal  and  near-normal  vision.  The 
readings  were  obtained  in  the  General  Electric  Research  Laboratories  under 
the  direction  of  Dr.  Matthew  Luckiesh  and  Mr.  Frank  K.  Moss. 


The  machine-duplicated  copies  were  run  from  Mimeograph 
stencils,  on  a  Mimeograph  machine  duplicator,  by  employees  of 
the  A.  B.  Dick  Company,  manufacturers  of  Mimeograph  prod¬ 
ucts.  The  stencils  were  cut  by  the  typist  who  prepared  the  type¬ 
written  materials,  and  the  same  typewriter  was  used.  Because 
of  the  large  size  of  the  type,  it  was  found  desirable  to  place  a 
sheet  of  Pliofilm  ^  over  the  face  of  the  stencil  to  insure  a  clear 
and  uniform  impression.  The  stencils  were  cut  very  deliber¬ 
ately,  and  all  capitals  were  struck  twice.  The  finished  copies 
were  carefully  inspected  and  passed  on  by  an  executive  of  the 
company.  The  resulting  product  was  the  best  Mimeography 
^Registered  trademark,  Goodyear  Tire  and  Rubber  Company. 
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that  could  be  produced  by  the  employees  of  a  company  with  a 
tradition  of  good  workmanship  to  uphold.  In  uniformity  and 
neatness,  it  was  much  superior  to  the  typewritten  copies,  al¬ 
though  it  was  somewhat  lighter  than  the  typing. 

The  page  arrangement  of  the  typed  and  Mimeographed  ma¬ 
terials  was  made  to  resemble  as  closely  as  possible  that  of  the 
24-point  type.  The  specifications  for  these  materials  were  as 
follows: 

1.  6-pica  left-hand  margin  (25  mm.,  or  1  in.). 

2.  39-pica  line  length  (165  mm.,  or  6.5  in.). 

3.  39  characters  to  a  39-pica  line,  right-hand  margin  unjusti¬ 
fied. 

4.  Identifying  numbers  flush  with  left-hand  margin. 

5.  0.75  picas  (%  in.)  between  lower  end  of  lower  case  de¬ 
scenders  of  one  line  and  upper  end  of  lower  case  ascenders 
in  line  below. 

6.  2.5  picas  (%6  in.)  between  descenders  of  last  line  in  the 
body  of  the  item,  and  ascenders  of  response  words. 

7.  5  picas  (2%2  in.)  between  descenders  of  response  words  of 
one  item  and  ascenders  of  first  line  of  next  item. 

8.  Response  words  spaced  according  to  length,  and  centered 
on  the  page. 

9.  Three  items  to  a  page. 

All  the  tests  were  printed  on  paper  8.5  inches  by  11.0  inches. 

THE  PAPER  USED  FOR  PRINTING  THE  TESTS 

The  paper  decided  upon  for  use  in  the  tests  was  chosen  for 
several  reasons.  An  effort  was  made  to  get  a  paper  closely  similar 
in  color  to  that  of  the  “Clear  Type”  publications  widely  used 
in  sight-saving  classrooms.  At  the  same  time,  it  was  necessary 
that  the  paper  be  one  which  would  “take”  printing,  typing,  and 
Mimeographing  well.  The  paper  finally  decided  upon  was  War¬ 
ren’s  Olde  Style  Wove,  India,  Substance  60.  It  is  a  light  cream- 
buff  colored  book  paper  of  high  quality,  with  a  dull,  light-dif- 
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fusing  surface.  The  characteristics  of  this  paper  with  respect  to 
glare,  opacity,  surface  smoothness,  and  color  are  specified  in 
Table  2.  The  glare  test  was  run  on  the  Bausch  and  Lomb 
glarimeter,  the  opacity  test  on  the  Bausch  and  Lomb  opaci- 
meter,  and  the  smoothness  test  on  the  Bekk  smoothness  tester. 

Table  2 

CERTAIN  CHARACTERISTICS  OF  THE  PAPER  UPON  WHICH 

TESTS  WERE  PRINTED 


Characteristic 


Rating 


Glare  (per  cent) 

Wire  .  6.7 

Felt .  4.8 

Opacity  (per  cent)  (contrast  ratio) .  92.4 

Surface  Smoothness 

Wire  .  20.0 

Felt .  14.2 

I.  C.  I.  Color  Specification  (Illuminant  C) 

L  (mmu)  . 576.5 

p  .  21.8 


The  values  in  Table  2  are  interpreted  as  follows: 

Glare:  The  glare  values  are  the  ratio,  expressed  in  per  cent, 
of  the  amount  of  light  reflected  from  the  paper  to  that  reflected 
by  a  plate  of  polished  black  glass  when  the  angles  of  incidence 
and  reflection  are  both  75  degrees  (with  respect  to  the  perpen¬ 
dicular)  .  The  values  given  in  Table  2  are  relatively  low. 

Opacity:  The  opacity  is  a  measure  of  the  contrast  between 
areas  of  the  paper  covering  black  and  white  backgrounds.  For 
example,  the  value  of  92.4  means  that  an  area  over  a  black  back¬ 
ground  reflects  92.4  per  cent  as  much  light  as  an  area  over  a 
(standardized)  white  background.  In  general,  this  ratio  is  di¬ 
rectly  related  to  opacity. 

Surface  Smoothness:  Bekk  Smoothness  is  the  method  com¬ 
monly  employed  for  measuring  the  smoothness  of  paper.  It  is  an 
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indirect  method,  based  on  the  fact  that  the  rate  of  leak  of  air 
between  a  flat  surface  and  the  paper  surface  depends  upon  the 
smoothness  of  the  latter.  The  Bekk  instrument  employs  an  opti¬ 
cally  flat  annular  surface,  which  is  placed  against  the  paper  under 
a  standard  load.  Air  under  a  standard  average  pressure  is  then 
permitted  to  leak  from  a  reservoir  connected  with  the  central 
space  defined  by  the  paper  and  the  hole  in  the  annular  ring.  The 
smoothness  is  expressed  in  terms  of  the  time  required  for  the  air 
pressure  to  fall  from  a  definite  value  above  the  standard  pres¬ 
sure  to  a  second  definite  value  below  the  standard  pressure. 

7.  C.  7.  Color  Specification:  “Illuminant  C”  is  a  carefully  stand¬ 
ardized  light  source  which  simulates  the  light  from  a  lightly 
overcast  sky.  “L”  is  the  dominant  wave  length  of  the  color,  ex¬ 
pressed  in  millimicrons  (one  millimicron  =  0.000001  milli¬ 
meters)  ;  “p”  is  the  purity  of  the  color;  “Y”  is  the  visual  efficiency 
of  the  color,  as  seen  under  illuminant  C.  If  spectrum  light  of 
wave  length  “L”  is  mixed  with  the  light  of  illuminant  C  in  a 
proportion  properly  corresponding  to  “p,”  and  of  intensity  cor¬ 
responding  to  “Y,”  the  color  of  the  specimen  as  seen  by  the 
“standard  observer”  will  be  matched.  For  hues  ranging  from 
violet  through  blue,  green,  etc.,  to  red,  the  dominant  wave  length 
varies  from  about  400  to  700  millimicrons.  The  purity  “p”  for 
a  pure  white  or  a  pure  gray  is  zero,  and  100  for  a  “saturated” 
color  (approached,  for  example,  by  a  red  signal  light).  The 
visual  efficiency  is  sometimes  called  visual  brightness,  and  is  a 
quantity  which  measures  the  sensation  of  lightness  and  darkness; 
“Y”  for  pure  black  is  zero,  and  100  for  a  nearly  perfect  white 
(magnesium  oxide  or  magnesium  carbonate). 

SUBJECTS 


Selection  of  the  subjects 

The  design  of  the  experiment  (see  Chapter  IV)  called  for  72 
subjects.  These  subjects  were  chosen  from  the  sight-saving 
classes  of  New  York  City.  According  to  the  Inspector  of  the 
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Classes  for  the  Blind,  the  general  standard  of  admission  to  the 
New  York  City  sight-saving  classes  is  visual  acuity  of  20/50  or 
less  in  the  better  eye,  with  correction.  Children  whose  visual 
acuity  is  less  than  20/200  in  the  better  eye,  wearing  correction, 
are  admitted  to  the  Braille  classes.  Frequently,  however,  pupils 
with  better  than  20/50  vision  are  admitted  to  the  sight-saving 
classes  when  the  examining  ophthalmologist  judges  that  the  chil¬ 
dren  have  an  eye  condition  which  would  be  seriously  aggravated 
by  the  visual  tasks  required  in  the  regular  classrooms.  The 
reason,  of  course,  is  that  defective  visual  acuity  in  distance  seeing 
is  only  one  of  the  evidences  of  visual  defect. 

The  teachers  of  the  classes  in  which  the  experiment  was  to  be 
conducted  were  asked  to  select  pupils  whose  reading  abilities 
were  definitely  above  third  grade  level.  Because  of  the  fact  that 
there  are  no  reading  tests  with  norms  standardized  on  a  com¬ 
parable  population,  no  objective  test  scores  could  be  obtained 
for  these  pupils.  Consequently,  the  selection  of  the  pupils  who 
satisfied  the  reading  level  requirement  had  to  be  left  to  the  judg¬ 
ment  of  the  teachers.  As  the  only  condition  was  that  the  pupils 
be  able  to  read  above  third  grade  level,  however,  it  was  felt  that 
such  judgment  could  be  relied  on. 

When  this  preliminary  selection  had  been  made,  the  visual 
diagnoses  of  the  pupils  chosen  were  carefully  examined.  Those 
pupils  who  verged  on  blindness,  or  who  had  an  eye  defect  which 
might  be  aggravated  by  the  test  conditions,  were  eliminated  from 
consideration.  A  prominent  eye  physician  assisted  with  this 
phase  of  the  selection.  From  the  group  of  pupils  who  met  the 
foregoing  conditions  satisfactorily,  12  groups  of  six  pupils  each 
were  selected.  These  12  groups  were  from  12  classes,  situated  in 
1 1  different  schools  in  the  boroughs  of  Manhattan  and  the  Bronx, 
New  York  City.^° 

Miss  Frances  Moscrip,  Inspector  of  the  Classes  for  the  Blind  in  the  Public 
Schools  of  the  City  of  New  York,  gave  permission  to  test  the  pupils  used  in 
the  experiment,  and  gave  access  to  the  records  of  the  children  tested. 
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Characteristics  of  the  population 

The  children  finally  selected  as  subjects  for  the  experiment 
ranged  from  9  years  of  age  to  14  years  of  age,  inclusive.  Ap¬ 
proximately  80  per  cent  of  them  were  about  equally  distributed 
through  the  ages  of  10,  11,  and  12.  Their  grade  placements 
ranged  from  grade  4  to  grade  8,  with  the  great  majority  in  grades 
5  and  6.  Of  the  total,  29  were  boys,  and  43  were  girls. 

Table  3  gives  the  visual  diagnosis  for  each  of  the  72  subjects. 
Some  of  the  diagnoses  were  made  by  private  physicians;  others 
are  diagnoses  made  at  public  health  eye  clinics.  The  near-point 
of  accommodation  readings  were  made  by  this  examiner,  using 
the  card  of  the  Ophthalmological  Foundation.  Near-point  of 
accommodation  is  the  nearest  point  to  the  eyes  at  which  the  sub¬ 
ject  is  able  to  focus  without  blurring  of  the  smallest  type  he  can 
read.  It  is  measured  in  millimeters,  and  expressed  in  fractional 
form,  with  the  millimeter  reading  as  the  numerator,  and  the  type 
size  rating  as  the  denominator. 

The  high  incidence  of  myopia  and  high  myopia  among  these 
children  is  typical  of  the  population  of  the  sight-saving  classes  of 
New  York  City.  Indeed,  no  other  single  eye  defect  approaches 
myopia  in  its  range  of  occurrence  among  these  pupils. 

It  had  been  planned  to  select  the  pupils  to  form  several  groups 
according  to  eye  defect,  so  that  comparisons  of  the  readability 
of  types  according  to  visual  anomaly  could  be  made.  It  was 
found,  however,  that  such  grouping  was  not  feasible,  and  the 
plan  was  abandoned  in  favor  of  a  group  chosen  without  respect 
to  eye  defect. 

As  was  mentioned  earlier  in  this  chapter,  the  reading  ability 
of  all  72  pupils  was  above  third  grade  level,  according  to  the 
judgment  of  the  teachers  of  the  pupils.  Also  in  the  judgment  of 
the  teachers,  the  best  readers  in  the  group  were  of  high  sixth,  or 
low  seventh,  grade  in  reading  level.  The  great  majority  were  of 
about  good  fifth  grade  level  in  reading;  that  is  to  say,  they  could 
read  with  facility  from  books  ordinarily  used  in  grade  five. 
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TESTING  CONDITIONS  AND  METHODS 

The  pupils  were  tested  in  groups  of  six,  with  no  other  pupils 
present  in  the  room.  The  teacher  of  the  class  being  tested  timed 
the  test  with  a  stop  watch.  To  insure  uniformity  of  lighting,  the 
pupils  were  seated  in  such  a  way  that  the  light  fell  on  the  papers 
from  the  left,  except  in  the  case  of  left-handed  pupils,  who  were 
seated  with  the  light  to  their  right.  Each  desk  surface  was  tested 
with  a  General  Electric  Light  Meter  (which  measures  light  in¬ 
tensity  in  foot-candles),  and  every  desk  surface  was  receiving 
between  twenty-seven  and  thirty-five  foot-candles  of  light  while 
the  tests  were  being  given. 

An  observer  was  seated  to  the  right  (to  the  left  for  left-handed 
pupils)  and  slightly  to  the  rear  of  each  pupil  being  tested.  The 
observers  code-marked  each  test,  and  made  certain  that  the  pupils 
understood  the  directions  for  taking  the  test.  During  the  five- 
minute  sampling  period,  each  observer  counted  the  eye-blinks 
of  his  subject,  and  at  the  conclusion  of  the  five  minutes  entered 
on  the  test  paper  of  the  subject  the  number  of  eye-blinks 
counted. 

Before  the  testing  for  each  group  began,  the  author  gave  the 
directions  for  the  test  procedure  in  these  words: 

We  are  going  to  have  some  easy  tests  of  reading.  One  of  our  visi¬ 
tors  is  going  to  sit  beside  each  of  you  to  make  sure  that  you  do  things 
in  the  right  way.  You  are  not  to  ask  questions  about  the  readings  but 
our  visitors  will  help  you  if  you  do  something  wrong. 

Now  let  us  look  at  the  directions.  (Reading  the  directions  which 
appear  on  the  front  of  each  test.)  “This  is  a  test  to  see  how  well  you 
can  read.  In  it  are  a  number  of  paragraphs  like  the  one  below.  Un¬ 
der  each  paragraph  are  four  words.  You  are  to  read  the  paragraph, 
and  then  draw  a  line  under 'the  one  word  which  is  right.  Remember, 
draw  a  line  under  only  one  word  after  each  paragraph.” 

Now  you  do  the  sample  yourself. 

(Pupils  do  the  sample,  with  assistance  from  the  observer  if  neces¬ 
sary.  When  all  have  finished  it  satisfactorily,  examiner  continues.) 

When  Miss _ says,  “Begin,”  you  are  to  turn  this  front  page 

back  and  start  reading.  Do  all  the  paragraphs  the  way  you  did  this 
sample.  As  you  finish  each  page,  turn  to  the  next,  and  keep  right  on 
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reading  until  Miss _ says,  “Stop.”  Read  as  many  paragraphs 

as  you  can  without  making  mistakes.  If  you  see  that  you  have  made 
a  mistake,  do  not  erase,  but  cross  out  the  line  this  way  (illustrating 
on  blackboard),  and  draw  another  line  under  the  one  word  you 
think  is  correct.  Remember  to  draw  a  line  under  only  one  word. 

At  the  end  of  exactly  five  minutes,  the  teacher  said,  “Stop.” 

The  pupils  took  all  six  test  forms  at  one  sitting.  There  were 
three-minute  rest  periods  between  the  tests,  with  the  exception 
of  the  third  and  fourth  test  forms.  Between  these  two  there  was 
a  five-minute  rest  period,  during  which  the  pupils  were  free  to 
leave  their  seats,  talk  with  each  other,  and  go  to  the  lavatory. 
As  will  be  explained  in  Chapter  IV,  all  six  pupils  in  each  group 
took  the  same  test  form  at  the  same  time,  but  at  any  given  time 
no  two  pupils  were  reading  the  same  type  variation. 

The  tests  were  scored  by  two  persons,  each  of  whom  checked 
his  scoring  once,  and  then  checked  the  scoring  of  the  other. 
Thus,  the  scoring  of  each  test  was  checked  twice. 

» 

Observers,  and  method  of  counting  eye-blink 

The  observers  who  counted  the  eye-blinks  were  six  adult 
graduate  students.  Each  was  known  to  be  a  careful,  thorough 
worker,  well  acquainted  with  experimental  technique,  and  aware 
of  the  importance  of  exactitude  in  experimentation.  Before 
testing  the  pupils  included  in  this  study,  each  observer  had  ample 
practice  in  counting  the  eye-blinks  of  other  persons.  The  author 
gave  careful  instructions  to  the  group  concerning  the  method  to 
be  used. 

Before  beginning  the  experiment,  a  mechanical  counter  was 
tried  in  counting  blinks.  It  was  found  that  the  noise  made  by 
the  counter  distracted  the  subjects,  and  this  device  was  discarded. 
The  observers,  therefore,  simply  counted  the  blinks  mentally. 
This  undoubtedly  introduced  an  element  of  error  into  the  scores. 
Luckiesh  and  Moss  use  this  technique,^^  and  it  has  been  men¬ 
tioned  that  they  report  consistently  positive  results. 

According  to  a  letter  to  the  author  from  Frank  K.  Moss. 
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SUMMARY 

Five  minutes  was  decided  upon  as  ample  time  for  each  of  the 
six  sampling  periods. 

Six  test  forms  of  50  items  each  were  used,  composed  of  items 
from  the  Gates  Modern  School  Achievement  Test  of  Reading 
Speed,  the  speed  test  from  the  Gates  Reading  Survey,  and  72 
items  closely  similar  to  the  items  in  these  tests,  written  by  the 
author  of  this  study.  The  entire  300  items  were  randomly  dis¬ 
tributed  through  the  six  test  forms. 

The  typographical  conditions  of  the  tests  were  determined  by 
advice  from  professional  authorities  in  the  field  of  typography. 
It  is  believed,  therefore,  that  the  typography  of  the  tests  con¬ 
forms  to  the  best  current  typographical  practice. 

The  paper  used  was  Warren’s  Olde  Style  Wove,  India,  Sub¬ 
stance  60.  It  is  a  light  cream-buff  colored  book  paper  of  high 
quality,  with  a  dull,  light-diffusing  surface. 

The  subjects  were  72  pupils  in  sight-conservation  classes  of  the 
New  York  City  public  school  system.  They  were  chosen  with¬ 
out  respect  to  eye  defect;  consequently,  the  group  is  heavily 
weighted  with  myopic  subjects.  All  subjects  were  judged  by 
their  teachers  to  be  above  third  grade  level  in  reading  ability. 
There  were  12  groups  of  six  pupils  each,  from  11  different 
schools  in  the  boroughs  of  Manhattan  and  the  Bronx,  New  York 
City.  Their  ages  ranged  from  9  to  14,  and  they  were  from  grades 
4  through  8.  Of  the  total,  29  were  boys  and  43  were  girls. 

The  pupils  were  tested  in  groups  of  six,  under  fairly  uniform 
conditions.  They  each  took  six  tests,  each  test  in  a  different  type 
variation,  each  lasting  five  minutes.  During  this  period,  an  ob¬ 
server  sat  beside  each  pupil,  and  counted  the  total  number  of 
eye-blinks  during  each  five-minute  period.  All  six  forms  were 
given  at  one  sitting  to  each  group,  with  brief  rest  periods  be¬ 
tween  tests. 


IV 


EXPERIMENTAL  DESIGN  AND 
STATISTICAL  METHOD 

IMPORTANCE  OF  EXPERIMENTAL  DESIGN 

The  relationship  between  experimental  design  and  valid  conclu¬ 
sions  has  been  overlooked  all  too  frequently  in  educational  re¬ 
search.  Yet  that  relationship  is  of  crucial  importance.  The  great 
majority  of  educational  experiments  are  concerned  with  measur¬ 
able  differences  between  groups,  or  differences  between  initial 
and  subsequent  measures  on  the  same  group.  The  experimenter 
is  generally  interested  in  determining  whether  differences  or 
changes  revealed  by  such  measures  have  been  effected  by  some 
given  treatment  or  factor. 

If  we  could  be  certain  that  all  factors  other  than  that  under 
study  were  perfectly  controlled,  we  could  then  be  certain  that 
the  changes  we  observed  could  be  attributed  to  that  factor  alone. 
It  is  well  known,  however,  that  a  perfectly  controlled  experi¬ 
mental  situation  never  exists,  particularly  in  educational  studies, 
in  which  there  are  so  many  human  variables.  Hence,  it  is  always 
possible  that  observed  differences  between  initial  and  final  meas¬ 
ures  may  be  attributable  to  factors  other  than  the  one  whose 
effect  we  are  attempting  to  determine,  or  that  real  differences 
induced  by  the  factor  being  studied  are  obscured  by  the  effects 
of  uncontrolled  or  partially  controlled  variables.  Consequently, 
we  can  never  treat  the  results  of  this  kind  of  research  as  “ab¬ 
solute”  data;  they  must  always  be  looked  upon  as  approximations 
of  the  actual  conditions,  approximations  into  which  the  uncon¬ 
trolled  factors  have  introduced  a  certain  amount  of  error. 


39 


40  THE  READABILITY  OF  TYPE  SIZES  AND  FORMS 


Fortunately,  statistical  methods  have  been  evolved  to  estimate 
the  effects  of  such  uncontrolled  variables.  The  differences  caused 
by  these  factors  are  considered  to  be  “errors  of  measurement,” 
and  the  statistical  estimates  of  their  maximum  effects  are  known 
as  “estimates  of  error.” 

With  the  foregoing  in  mind,  we  can  see  that  there  are  two 
conditions  to  be  satisfied  in  any  educational  experiment.  First, 
steps  must  be  taken  to  provide  adequate  control  of  as  many  vari¬ 
ables  as  possible;  second,  provision  must  be  made  for  arriving  at 
a  dependable  estimate  of  the  effect  of  variables  not  fully  con¬ 
trolled — a  dependable  estimate  of  error. 

If  the  second  condition  is  met,  the  fact  that  some  factors  whose 
effects  we  wish  to  discount  are  but  imperfectly  controlled  need 
give  us  little  concern,  provided  they  have  been  randomized.  Our 
estimate  of  error  will  give  us  an  approximation  of  their  maximum 
influence  upon  the  results.  The  importance  of  such  an  estimate 
is  well  stated  by  Lindquist: 

It  is  extremely  significant  that  unless  one  has  a  fairly  definite  and 
dependable  (even  though  subjectively  derived)  estimate  of  the 
maximum  error  which  might  be  present  in  an  obtained  difference, 
one  can  draw  no  useful  conclusion  from  that  difference,  no  matter 
how  precise  it  may  be  in  reality.  So  long  as  the  error  may  be  of  any 
magnitude,  it  is  always  conceivable  that  the  difference  is  due  to  error 
alone.  Given  a  dependable  estimate  of  error,  one  can  demonstrate 
that  certain  hypotheses  are  inconsistent  with  the  results  obtained; 
without  any  such  estimate  any  hypothesis  whatever  is  admissible, 
including  of  course  the  hypothesis  that  there  are  no  real  differences 
in  treatments.  In  a  very  real  sense,  then,  it  is  more  important  to  know 
the  degree  of  precision  of  an  experiment,  whether  high  or  low,  than 
it  is  that  the  precision  be  in  reality  high.^ 

The  manner  in  which  an  experiment  is  designed  will  in  large 
measure  determine  the  accuracy  of  the  error  estimate.  Some  ex¬ 
perimental  designs  tend  to  exaggerate  the  effects  of  extraneous 
factors,  leading  us  to  believe  that  our  observed  differences  are 
less  significant  than  they  actually  are.  Other  types  operate  to 

^  E.  F.  Lindquist,  Statistical  Analysis  in  Educational  Research,  p.  77.  Hough¬ 
ton  Mifflin  Company,  Boston,  1940. 
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underestimate  the  magnitude  of  error,  leading  to  conclusions  that 
the  observed  differences  are  more  significant  than  they  are  in 
actuality.^ 


DESIGN  OF  THE  EXPERIMENT 


After  considering  experimental  designs  possible  in  the  present 
experiment,  it  was  decided  to  utilize  analysis  of  variance^  de¬ 
veloped  in  recent  years  by  R.  A.  Fisher.  This  method  of  statisti¬ 
cal  analysis  has  as  yet  had  relatively  little  application  in  the  field 
of  educational  research.  The  technique  it  employs  provides  for 
a  dependable  estimate  of  the  error  inherent  in  this  kind  of  experi¬ 
ment. 

The  fundamental  thesis  of  analysis  of  variance  is  that  the 
total  variance  of  a  large  sample  containing  a  number  of  groups 
equal  in  size  can  be  divided  into  two  components:  the  average  of 
the  variances  within  the  groups,  and  the  variance  among  the 
group  means.  These  two  variances  provide  independent  esti¬ 
mates  of  the  population  parameter,  one  being  an  estimate  of  the 
population  variance  based  on  the  average  variance  within  the 
groups  of  the  sample,  the  other  being  an  estimate  of  the  popula¬ 
tion  variance  based  on  the  variance  of  the  means  of  the  sample 
groups.  Assuming  that  the  samples  from  which  these  two  esti¬ 
mates  were  derived  are  random  samples  of  the  same  population, 
these  estimates  of  the  population  variance  should  differ  only 
by  chance.  This  hypothesis  can  be  tested  by  means  of  the  F-test, 
to  determine  whether  the  ratio  between  these  two  parts  of  the 
total  variance  is  larger  than  chance  allows.  If  it  is,  we  have  evi¬ 
dence  that  the  subgroups  are  not  samples  of  the  same  population, 
and  that  a  true  difference  exists  between  them.^ 

One  of  the  conditions  of  this  procedure  is  that  all  factors  which 
might  introduce  systematic  differences  must  be  randomized. 


^  For  a  more  thorough  treatment  of  this  principle,  see  Lindquist,  op.  cit.. 
Chap.  IV. 

®This  is  a  drastic  precis  and  oversimplification  of  the  topic.  For  a  more 
thorough  development  of  the  logic  of  the  proposition,  see  E.  F.  Lindquist, 
Statistical  Analysis  in  Educational  Research,  Chapter  V,  “Analysis  of  Variance.” 
Houghton  Mifflin  Company,  Boston,  1940. 
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The  necessity  for  this  precaution  can  be  readily  understood  from 
the  fact  that  the  F-test  is  a  test  of  the  degree  to  which  the  vari¬ 
ation  among  the  observed  differences  is  greater  than  that  to  be 
expected  by  chance,  and  this  can  be  estimated  only  when  random 
selection  has  been  provided  for. 

Patter?!  of  test  administration 

To  randomize  the  effects  of  the  various  factors,  the  tests  were 
administered  in  a  Latin  Square  pattern  to  each  group  of  six  pupils. 
As  there  were  six  test  forms,  each  form  printed  in  six  type  vari¬ 
ations,  there  were  altogether  thirty -six  combinations  of  test  form 
and  type  variation.  In  the  first  group  of  six  pupils,  each  pupil 
took  six  of  these  combinations,  each  pupil  taking  six  combinations 
different  from  those  taken  by  any  of  the  other  five  pupils.  The 
same  process  was  repeated  with  the  other  five  groups  of  subjects 
in  the  first  group  of  thirty-six  children,  except  that  the  order  of 
test  administration  was  changed  in  each  case.  This  resulted  in 
assuring  that  every  possible  combination  of  type  size,  test  form, 
and  order  of  test  administration  (216  such  combinations  were 
possible)  was  encountered  by  each  of  the  two  groups  of  thirty- 
six  pupils,  without  any  two  pupils  within  a  group  taking  the  same 
six  combinations.  In  Figure  I,  for  example,  it  can  be  observed 
that  Pupil  1  is  the  only  pupil  of  that  group  of  thirty-six  to  take 
Form  C  in  12-point  type  in  third  position  of  test  order.  This 
entire  procedure  was  then  replicated  with  the  second  group  of 
thirty-six  pupils. 


ANALYSIS  OF  THE  DATA 

An  extension  of  the  basic  proposition  of  analysis  of  variance 
allows  us  to  make  more  intricate  analyses.  Instead  of  estimating 
the  contribution  (to  the  total  variance)  of  only  t'wo  of  the 
sources  of  variation,  this  technique  makes  possible  an  analysis  of 
the  variance  into  a  number  of  components  in  such  a  way  as  to 
estimate  the  variance  induced  by  each,  as  well  as  the  amount  of 
variance  attributable  to  undetermined  or  unrandomized  factors 
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(these  latter  lumped  together  as  one  factor).  The  F-test  can 
then  be  applied  to  determine  the  significance  of  the  difference 
among  the  variances  attributable  to  these  factors.  In  the  present 
study  the  data  were  analyzed  to  estimate  the  variance  derived 
from  each  of  five  sources: 

1.  Differences  among  individuals 

2.  Order  of  test  administration 

3.  Test  form 

4.  Type  variation 

5.  Error  (residual  variance) 

r 

In  some  experiments  using  the  technique  of  analysis  of  vari¬ 
ance,  the  error  (or  residual)  variance  is  further  analyzed  for  the 
variance  attributable  to  the  interaction  among  various  factors. 
In  this  study,  for  example,  it  is  possible  (though  improbable) 
that  some  of  the  variance  might  have  been  assignable  to  the  inter¬ 
action  between  type  variation  and  order  of  test  administration. 
That  is,  a  given  type  form  might  show  better  results  in  reada¬ 
bility  when  administered  first,  rather  than  third  or  last.  Even 
were  such  interaction  probable,  however,  it  would  have  been 
impossible,  in  the  pattern  of  this  experiment,  to  analyze  its  ef¬ 
fects.  To  do  so,  it  would  have  been  necessary  for  each  subject 
to  experience  every  possible  combination  of  test  form,  type  vari¬ 
ation,  and  test  order — clearly  a  practical  impossibility.  Because 
of  this,  the  effects  of  such  interaction  (such  as  they  might  be) 
are  confounded  with  the  individual  differences  variance. 

In  making  an  analysis  of  variance,  four  quantities  must  be  de¬ 
termined  for  each  component:  The  sum  of  squares,  the  degrees 
of  freedom,  the  variance,  and  the  F-value.  Given  below  are  the 
formulas  by  which  an  estimate  of  the  variance  of  each  of  the 
major  sources  of  variation  in  this  study  may  be  derived.  The 
numerator  quantity  in  each  case  is  the  sum  of  squares  for  that 
factor,  whereas  the  denominator  is  the  number  of  degrees  of 
freedom.  The  symbolism,  as  it  is  used  herein,  is  explained  be¬ 
forehand: 
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=  variance  (factor  indicated  by  subscript) 

N  =  number  of  scores  (432) 
n  =  number  of  subjects  (72) 
k  =  number  of  groups  (6  in  every  case) 

X  =  any  score 

X^  =  score  of  any  individual 

=  score  of  any  designated  test  order  position,  from  1  to  6 
Xf  =  score  of  any  designated  test  form,  A  to  F 
X^  =  score  of  any  designated  type  variation 


Basic  formulas 

(1)  Total  variance: 

(SX)2 

2X2 - 

N 

S^T  =  - 

df={N-l) 

(2)  Variance  induced  by  differences  among  individuals: 

(2X,)2  (SX)2 

2 - 

k  N 

df={n-\) 

(3)  Variance  induced  by  order  of  test  administration: 

(2Xo)2  (SX)2 

2 - 

n  N 

52  _ _ 

df  =  {h  -  1) 

(4)  Variance  induced  by  test  forms: 

(2X/)2  (2X)2 

2 - 

n  N 

S^f  = - 

df= 

(5)  Variance  induced  by  type  variation: 

(SX,)2  (2X)2 


n  N 

S\  = - 


df  =  {kt  -  1) 
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(6)  Residual  variance: 

Total  sum  _  /  Sum  of  squares  for  all  other \ 
of  squares  \ systematic  forms  of  variance/ 

resid  — • - 

dJ=N-in-l)-{ko-l)-  {kf  -  1)  -  (kt  -  1) 

Analyses  of  the  above  components  were  made  for  each  of  the 
two  criteria:  speed  and  eye-blink.  The  results  of  these  analyses 
are  given  in  Tables  4  and  5.  As  we  are  trying  to  determine 
whether  or  not  the  systematic  variance  for  each  of  the  compo¬ 
nents  is  significantly  greater  than  the  error  variance,  the  F-value 
for  each  is  computed  by  using  the  variance  of  the  component 
concerned  as  the  numerator  of  a  fraction  whose  denominator  is 
always  the  residual  variance.  The  resulting  F-value  can  then  be 
referred  to  a  table  of  such  values  computed  by  Snedecor,^  which 
gives  the  values  exceeded  by  chance  in  5  per  cent  of  the  cases 
and  in  1  per  cent  of  the  cases,  for  various  combinations  of  de¬ 
grees  of  freedom.  It  will  be  observed  from  the  tables  that  the 
F-values  with  respect  to  type  variation  are  not  significant  for 
either  of  the  two  criteria.  In  every  case,  the  variances  arising 
from  differences  in  type  variation  are  smaller  than  the  variance 
attributable  to  any  other  source,  and  are  even  smaller  than  the 
variation  to  be  expected  from  chance  alone.  In  other  words,  the 
statistical  analysis  revealed  no  systematic  differences  among  the 
type  variations  in  the  criterion  scores.  The  means  for  each  type 

Table  4 

ANALYSIS  OF  VARIANCE  FOR  SPEED  SCORES 


Sum  of  Degrees  of 

Source  of  Variation  Squares  Freedom  Variance  F-Values  F.os  F.oi 


Individual  differences  9,203  71  129.62  36.929  1.37  1.56 


Test  form  .  448  5  89.60  25.527  4.37  9.04 

Testing  order  .  116  5  23.20  6.610  4.37  9.04 

Type  variation .  7  5  1.40  .400  4.37  9.04 

Residual  (error)  ....  1,211  345  3.51 

Total  .  10,985  431  25.49 


^  G.  W.  Snedecor,  Statistical  Methods,  Table  10.2,  pp.  174-177.  Reproduced 
in  Lindquist,  op.  cit.,  Table  4,  pp.  62-65. 
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Table  5 

ANALYSIS  OF  VARIANCE  FOR  EYE-BLINK  SCORES 


Sum  of  Degrees  of 

Source  of  Variation  Squares  Freedom  Variance  F -Values  F.os  E.oi 


Individual  differences  54,953  71  773.99  22.598  1.37  1.56 


Test  form  .  175  5  35.00  1.022  4.37  9.04 

Testing  order .  113  5  22.60  .660  4.37  9.04 

Type  variation .  105  5  21.00  .600  4.37  9.04 

Residual  (error) .  11,816  345  34.25 

Total  .  67,162  431  155.83 


variation  are  presented  in  Table  6,  to  illustrate  how  little  differ¬ 
ence  there  was  among  them. 


Table  6 

MEANS  OF  EYE-BLINK  AND  SPEED  SCORES 
FOR  EACH  TYPE  VARIATION 


T 

M 

12 

14 

18 

24 

General 

Mean 

Eye-blink  ... 

. ...  15.50 

15.78 

16.17 

15.71 

16.57 

15.00 

15.79 

Speed  . 

....  15.36 

15.21 

15.19 

15.47 

15.10 

15.28 

15.27 

Although  the  F-value  with  respect  to  type  variation  is  that 
with  which  this  study  is  primarily  concerned,  it  is  revealing  to 
note  the  values  for  the  other  components.  As  is  to  be  expected, 
the  variance  attributable  to  the  differences  among  the  subjects 
was  the  largest  and  most  highly  significant  for  both  criteria.  For 
the  speed  scores  (Table  4,  p.  46)  the  next  most  significant  vari¬ 
ance  was  that  arising  from  differences  in  test  form.  There  are, 
of  course,  no  reading  tests  with  several  forms  which  are  per¬ 
fectly  equated.  The  fact  that  this  lack  of  equivalence  is  re¬ 
vealed  and  accounted  for  in  the  F-test  is  an  illustration  of  the 
value  of  the  factorial  design  employed,  and  of  analysis  of  vari¬ 
ance.  It  will  be  noted  that  the  analysis  also  accounted  for  the 
variation  traceable  to  the  order  in  which  the  tests  were  taken. 
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The  high  degree  of  significance  of  the  variance  arising  from 
the  effects  of  individual  differences  for  both  speed  and  eye-blink 
scores,  and  of  the  variance  attributable  to  test  form  and  testing 
order  in  the  speed  scores,  serves  to  emphasize  the  necessity  for 
controlling  these  sources  of  variation,  or  for  providing  for  a  de¬ 
pendable  estimate  of  their  maximum  effects.  The  value  of  the 
statistical  technique  employed  in  this  investigation  is  evidenced 
by  the  fact  that  it  provided  an  estimate  of  the  effects  of  the  ma¬ 
jor  sources  of  variation,  which  would  probably  have  been  un¬ 
accounted  for  to  a  large  extent  had  another  technique  been  em- 

Analysis  by  visual  anomaly 

Because  it  was  possible  that  some  true  differences  were  hidden 
in  the  data,  a  further  analysis  was  made.  From  the  72  subjects 
were  chosen  40  with  myopia  or  myopic  astigmatism,  and  10  with 
hyperopia  or  hyperopic  astigmatism.  Children  with  these  de¬ 
fects  were  chosen  for  two  reasons:  (1)  because  these  defects 
appeared  with  greater  frequency  than  any  others  among  the 
subjects,  and  (2)  because  it  was  felt  that  if  there  were  any  dif¬ 
ferences  in  readability  induced  by  visual  anomaly,  they  would 
be  most  clearly  exhibited  and  contrasted  in  near-sighted  and  far¬ 
sighted  subjects. 

The  procedure  in  this  analysis  was  that  of  finding  within  each 
group  (and  for  each  of  the  two  criteria)  the  mean  for  each  type 
variation,  and  determining  whether  there  were  significant  dif¬ 
ferences  among  or  between  them.  This  was  done  for  both  speed 
of  reading  and  eye-blink  frequency  for  each  of  the  groups.  The 
method  employed  to  test  the  significance  of  the  differences  be¬ 
tween  means  was  that  of  comparing  each  difference  with  its  own 
standard  error.  The  difference  would  be  considered  significant 
if  it  were  as  much  as  2.576  times  its  own  standard  error  (the  .01 
level) . 

In  making  an  analysis  of  this  kind  with  these  data,  it  must  be 
recognized  that  the  scores  for  the  various  type  forms  tested  are 


EXPERIMENTAL  DESIGN  AND  STATISTICAL  METHOD  49 


correlated.  That  is,  the  subjects  who  read  fastest  while  reading 
one  type  form  are  very  likely  those  who  read  fastest  in  the 
reading  of  the  other  forms  as  well.  Similarly,  those  with  a 
normally  high  blink  rate  will  be  likely  to  blink  more  frequently 
than  the  other  subjects,  no  matter  what  type  forms  they  are 
reading.  This  need  give  little  concern  in  treating  and  interpret¬ 
ing  the  data,  as  the  problem  here  is  each  individuaPs  variation 
from  type  form  to  type  form,  regardless  of  whether  his  speed  or 
blink  rate  be  consistently  high  or  low.  In  other  words,  we  are 
investigating  the  differences  among  the  scores  for  type  form,  not 
the  differences  among  individuals.  Nevertheless,  the  correla¬ 
tions  induced  by  this  factor  must  be  taken  into  consideration  in 
the  statistical  analysis.  In  the  analysis  of  variance,  it  was  ac¬ 
counted  for  in  the  “Individual  Differences”  variance.  In  making 
an  analysis  by  the  method  of  testing  the  significance  of  differ¬ 
ences  between  means,  this  correlation  must  again  be  accounted 
for,  as  the  differences  being  tested  are  differences  between  the 
means  of  related  measures. 

Consequently,  the  sixty  correlations  among  the  scores  were 
computed.  They  ranged  from  .670  to  .931  for  the  eye-blink 
scores,  with  a  median  of  .790,  and  from  .648  to  .958  for  the  speed 
scores,  with  .818  as  the  median.  All  were  positive.  These  r’s 
were  then  employed  in  the  formula: 


~  My  —  “h  ^My  ~  ^rxy^Mj^My 


to  compute  the  standard  error  of  each  difference  between 
means.® 

For  the  myopes,  the  standard  error  of  the  mean  was  arrived 
at  by  using  the  formula:  ® 


M 


^  (sample) 

N 


®E.  F.  Lindquist,  A  First  Course  in  Statistics,  pp.  124-126.  Houghton  Mifflin 
Company,  Boston,  1938. 

^Ibid.,  pp.  105-115. 
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This  formula  was  not  appropriate  for  estimating  the  standard 
error  of  the  mean  for  the  hyperopes,  however,  as  the  sample  was 
so  small  (10  cases).  Consequently,  the  formula: 


<T 


M 


‘5'  (sample) 


N-  1 


was  employed,  as  it  gives  a  more  dependable  estimate  for  small 
samples  (less  than  25  cases). ^ 

The  means  of  the  scores  for  these  two  groups  are  presented 
in  Table  7.  Table  8  gives  the  results  of  the  foregoing  analysis. 
It  will  be  noted  that,  of  the  sixty  differences  between  means 
which  were  tested,  only  three  were  significant  at  the  .01  level. 
For  at  least  two  reasons,  these  probably  may  be  disregarded. 
First,  out  of  the  sixty  differences,  it  is  not  unlikely  that  there 
will  be  a  few  significant  ones  simply  by  chance.  Second,  the 
differences  which  are  significant  do  not  “make  sense.”  For  the 
myopes,  both  significant  differences  are  between  “adjacent” 
sizes  of  type.  Whereas  the  differences  between  14-  and  18-point 
and  18-  and  24-point  type  are  significant,  the  difference  between 
14-  and  24-point  type  is  not.  For  hyperopes,  the  difference  be- 


Table  7 

MEANS  OF  SPEED  AND  EYE-BLINK  SCORES  FOR  MYOPES 
AND  HYPEROPES,  WITH  DIFFERENCES 


Type  Size 


T  M  12  14  18  24 


Speed 

Myopes  .  15.43  15.65  15.78  15.85  15.65  15.50 

Hyperopes  .  13.00  12.20  12.60  13.20  12.00  12.70 

Difference  .  2.43  3.45  3.18  2.65  3.65  2.80 

Eye-Blink 

Myopes  .  15.08  15.68  15.90  13.90  16.55  14.25 

Hyperopes  .  13.00  15.00  16.50  17.00  15.80  13.10 

Difference  .  2.08  .68  -.60  -3.10  .75  1.15 


Ibid.,  p.  113,  footnote. 
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Table  8 

DIFFERENCES  BETWEEN  MEANS,  WITH  STANDARD  ERRORS 
for  SPEED  AND  EYE-BLINK  SCORES  OF  MYOPES  AND 

HYPEROPES 


Difference  between 
Means  of 

Eye-Blink 

Speed 

Myopes 

Hyperopes 

Myopes 

Hyperopes 

Diff. 

(TDiff. 

Diff. 

(TDiff. 

Diff. 

(TDiff. 

Diff. 

(TDiff. 

T  and  M  . 

.60 

1.16 

2.00 

1.74 

.22 

.42 

.80 

.52 

T  and  12  . 

.82 

1.68 

3.50 

2.44 

.35 

.54 

.40 

.49 

T  and  14  . 

1.18 

1.57 

4.00 

2.28 

.42 

.46 

.20 

.99 

T  and  18  . 

1.47 

1.42 

2.80 

2.52 

.22 

.46 

1.00 

1.04 

T  and  24  . 

.83 

1.39 

.10 

1.46 

.07 

.41 

.30 

1.00 

M  and  12  . 

.22 

1.38 

1.50 

2.24 

.13 

.51 

.40 

.65 

M  and  14 . 

1.78 

1.46 

2.00 

2.10 

.20 

.47 

1.00 

.95 

M  and  18  . 

.87 

1.37 

.80 

2.80 

.00 

.51 

.20 

1.02 

M  and  24  . 

1.43 

1.04 

1.90 

1.63 

.15 

.58 

.50 

.94 

12  and  14  . 

2.00 

1.51 

.50 

2.79 

.07 

.42 

.60 

1.45 

12  and  18  . 

.65 

1.42 

.70 

1.81 

.13 

.52 

.60 

.82 

12  and  24  . 

1.65 

1.11 

3.40 

1.98 

.28 

.57 

.10 

.99 

Hand  18 . 

2.65* 

.99 

1.20 

2.02 

.20 

.37 

1.20 

1.00 

14  and  24  . 

.35 

1.61 

3.90* 

1.55 

.35 

.54 

.50 

1.33 

18  and  24  . 

2.30* 

.72 

2.70 

1.67 

.15 

.44 

.70 

1.25 

*  Exceeds  2.576  times  the  standard  error. 


tween  14-  and  24-point  type  is  significant,  but  not  that  between 
12-  and  24-point  type,  nor  that  between  12-  and  18-point  type. 
We  are  probably  safe  in  assuming,  therefore,  that  no  real  differ¬ 
ences  among  the  type  variations  are  indicated  by  this  analysis. 

Whereas  the  foregoing  analysis  yielded  no  results  justifying 
any  conclusion  that  one  type  is  more  readable  than  another  for 
either  myopes  or  hyperopes,  some  differences  between  the  mean 
scores  for  the  two  groups  are  suggestive.  It  will  be  noted  in 
Table  7,  for  example,  that  there  is  a  consistent  difference  be¬ 
tween  the  means  of  the  speed  scores  for  the  myopes  and  hy¬ 
peropes.  Fcr  the  eye-blink  scores,  however,  the  differences  are 
not  so  consistent.  Here  the  blink  rate  of  the  myopes  exceeded 
that  of  the  hyperopes  ^or  all  but  the  smallest  type  sizes.  This 
difference  was  greatest  for  typing  (which  approximated  the  24- 
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point  type  in  size)  and  24-point  Caslon,  the  largest  of  the  type 
sizes.  For  the  smallest  type  sizes,  the  hyperopes  blinked  more 
frequently  than  the  myopes,  the  greatest  difference  occurring 
in  the  case  of  14-point  type. 

Relationship  between  the  criteria 

As  one  of  the  purposes  of  the  study  was  to  appraise  the  rela¬ 
tionship  obtaining  between  the  criteria  of  reading  speed  and 
eye-blink  frequency,  the  data  were  examined  for  what  they 
could  reveal  in  that  respect.  For  each  subject,  the  highest  and 
lowest  reading  speed  scores  were  ascertained,  and  compared 
with  the  corresponding  eye-blink  scores.  If  the  criteria  were 
related,  we  should  expect  the  great  majority  of  cases  to  be 
consistent  in  showing  either  a  direct  or  an  inverse  relationship. 
Actually,  a  direct  relationship  was  revealed  in  32  cases,  an  in¬ 
verse  relationship  in  34  cases,  with  6  cases  showing  no  relation¬ 
ship.  In  the  experimental  group,  therefore,  the  criteria  showed 
little  or  no  relationship. 


SUMMARY 

The  experiment  was  set  up  for  a  five-component  analysis  of 
variance.  The  tests  were  administered  in  a  Latin  Square  pattern, 
wherein  each  child  took  all  six  forms  and  read  all  six  type  vari¬ 
ations,  but  no  two  children  in  each  of  the  two  large  groups  of 
36  pupils  took  the  same  combinations  of  test  form,  type  vari¬ 
ation,  and  test  order.  The  results  were  analyzed  for  the  signifi¬ 
cance  of  the  statistical  variation  induced  by  each  of  the  compo¬ 
nents  analyzed,  with  particular  attention  to  the  significance  of 
the  variance  assignable  to  type  variation.  The  F-value  for  type 
variation  was  insignificant  at  the  .05  level  for  both  criteria:  read¬ 
ing  speed  and  eye-blink  frequency.  For  the  speed  scores,  the 
variance  attributable  to  individual  differences,  test  form,  and 
order  of  testing  were  all  highly  significant.  Only  variance  in¬ 
duced  by  individual  differences  was  significant  for  the  eye-blink 
scores. 
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The  scores  of  a  group  of  40  myopes  and  a  group  of  10  hy- 
peropes  were  then  analyzed  to  ascertain  whether  there  was  a 
difference  between  the  means  of  the  type  size  scores,  a  differ¬ 
ence  which  might  be  attributed  to  the  type  of  visual  impairment. 
For  each  of  these  two  groups  the  differences  between  the  mean 
scores  of  the  various  type  sizes  were  tested  for  significance.  The 
results  were  such  as  to  indicate  that  no  real  differences  existed. 

An  effort  was  also  made  to  test  the  degree  of  relationship  be¬ 
tween  the  two  criteria:  reading  speed  and  eye-blink  frequency. 
The  results  indicated  that  practically  no  relationship  obtains  in 
this  respect  with  the  group  of  subjects  studied. 


V 


ANALYSIS  AND  INTERPRETATION 

OF  THE  RESULTS 

The  results  of  the  statistical  analysis  of  the  data  may  be  sum¬ 
marized  as  follows: 

1.  Analysis  of  variance  revealed  no  significant  differences  in 
speed  with  which  the  different  type  variations  were  read. 

2.  Analysis  of  variance  revealed  no  significant  differences  in 
the  frequency  of  eye-blink  for  each  of  the  type  variations. 

3.  The  speed  criterion  revealed  significantly  large  differences 
among  the  test  forms  and  the  factor  of  testing  order.  Eye- 
blink  scores  did  not  reflect  these  differences. 

4.  There  was  no  statistically  reliable  difference  among  the 
means  of  the  type  variation  scores  in  the  case  of  either  the 
myopic  or  the  hyperopic  subjects.  Nevertheless,  sugges¬ 
tively  patterned  differences  between  these  groups  seemed 
to  favor  the  larger  types  for  the  hyperopes. 

5.  There  appeared  to  be  no  relationship  between  the  criteria 
employed. 

6.  For  adult  observers  of  normal  or  near-normal  vision, 
24-point  type  is  the  most  visible  of  the  types  studied  (as 
measured  by  the  Luckiesh-Moss  Visibility  Meter),  fol¬ 
lowed  in  order  by:  Ampli-type,  typed  (dark);  18-point; 
Ampli-type,  typed  (light);  Ampli-type,  Mimeographed; 
14-point;  and  12-point. 

POSSIBLE  INTERPRETATIONS  OF  THE  RESULTS 

The  complete  absence  of  significant  differences  among  the 
type  variations  with  respect  to  reading  speed  and  eye-blink 
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frequency  is,  at  first,  rather  surprising.  On  the  basis  of  previous 
studies  and  common  observation,  we  should  be  led  to  expect  a 
real  difference  between  at  least  the  extremes  of  12 -point  and 
24-point  type.  The  absence  of  positive  results  is  subject  to  at 
least  three  interpretations: 

1.  For  visually  handicapped  children,  reading  for  brief  pe¬ 
riods,  there  are  no  real  differences  in  the  readability  of  the 
type  variations  studied. 

2.  For  visually  handicapped  children  there  are  real  differ¬ 
ences  in  the  readability  of  the  type  variations  studied, 
but  the  criteria  used  did  not  measure  them  adequately. 

3.  For  visually  handicapped  children  there  are  real  differ¬ 
ences  in  the  readability  of  the  type  variations  studied,  but 
they  were  obscured  by  error  for  which  the  experimental 
design  provided  no  adequate  estimate,  and  which  swelled 
the  error  variance. 

EVALUATION  OF  POSSIBLE  INTERPRETATIONS 

It  is  impossible,  of  course,  to  say  which  of  these  three  inter¬ 
pretations  of  the  data  is  the  correct  one.  Perhaps  the  absence  of 
positive  results  cannot  be  attributed  to  any  one  factor  alone. 
Nevertheless,  the  data  should  be  examined  to  determine  whether 
one  interpretation  seems  more  plausible  than  another. 

Interpretation  1  runs  counter  to  popular  belief  for  a  number 
of  reasons.  For  example,  there  are  many  investigations  report¬ 
ing  differences  in  the  readability  of  less  widely  differing  type 
sizes  than  those  studied  in  this  experiment.  It  is  true  that  almost 
all  such  studies  are  concerned  with  smaller  type  sizes  and  with 
subjects  of  normal  vision;  hence,  they  are  really  not  comparable 
with  this  investigation.  Nevertheless,  they  do  predispose  one 
to  expect  differences  in  readability  among  types  differing  as 
much  as  those  examined  in  the  present  study. 

The  fact  that  school  children  show  a  strong  preference  for 
books  with  relatively  larger  types  lends  credence  to  this  view¬ 
point.  It  can  also  be  pointed  out  that  the  practice  of  using  large 
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type  in  the  sight-saving  classes  is  based  on  more  than  Irwin’s 
findings/  The  teachers  of  these  classes  relate  that  type  sizes  less 
than  18  points  in  size  cause  a  number  of  pupils  to  squint  and 
assume  unnatural  reading  positions,  presumably  because  such 
types  are  difficult  for  them  to  read.  Some  pupils  actually  com¬ 
plain  of  discomfort  from  reading  types  smaller  than  18  points  in 
size.  The  visibility  ratings  in  Table  1  (p.  27)  also  support  this 
belief. 

Nevertheless,  the  results  of  this  study  are  such  as  to  render 
interpretation  1  a  real  possibility.  In  the  variances  of  the  reading 
speed  scores,  for  example,  it  will  be  noted  that  the  statistical  tool 
was  precise  enough,  and  the  criterion  sufficiently  sensitive,  to 
reveal  significantly  large  variances  for  test  form  and  order  of 
testing.  It  is  reasonable  to  assume  that  had  there  existed  com¬ 
parable  differences  with  respect  to  type  variation  they  would 
have  been  made  evident.  Apparently,  therefore,  for  the  subjects 
tested  in  this  study,  there  is  not  as  great  a  difference  in  the  reada¬ 
bility  of  the  types  investigated  as  seems  to  be  commonly  assumed 
— at  least  when  reading  for  brief  periods  of  time. 

Validity  and  reliability  of  the  criteria 

Turning  our  attention  to  interpretation  2,  we  are  faced  with 
the  question  of  the  adequacy  of  the  criteria.  In  Chapter  II  it  was 
indicated  that  Luckiesh  and  Moss  consider  reading  speed  to  be 
too  insensitive  to  changes  in  the  severity  of  the  visual  task  to  be 
of  much  value  as  a  criterion,  particularly  in  the  brief  span  of  5 
minutes.^  On  the  other  hand,  the  criterion  of  eye-blink  fre¬ 
quency,  endorsed  virtually  without  qualification  by  Luckiesh 
and  Moss,  has  not,  to  the  knowledge  of  the  writer,  been  vali¬ 
dated  by  any  investigators  except  these  two.  This  takes  us  back 
to  the  problem  at  the  beginning  of  Chapter  II;  namely,  that  there 

^  See  page  3  for  Irwin’s  conclusions. 

^  M.  Luckiesh  and  F.  Moss,  Visibility  and  Readability  of  Print  on  White  and 
Tinted  Papers,  p.  3.  Publication  271,  National  Society  for  the  Prevention  of 
Blindness,  Inc.,  New  York.  (Reprinted  from  The  Sight-Saving  Review,  Vol.  8, 
No.  2,  June,  1938.) 
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is  no  criterion  of  readability,  or  of  visual  effort  or  fatigue,  which 
is  universally  agreed  upon  as  being  valid  and  reliable. 

The  validity  of  the  criterion  of  reading  speed  is,  as  far  as  the 
writer  is  aware,  unquestioned;  its  severest  critics  grant  that  it 
may  measure  (albeit  insensitively)  readability.  The  fact  that  it 
is  insensitive  to  relatively  small  changes  in  readability  is  its  great¬ 
est  drawback.  That  is  admitted  by  its  warmest  proponents.  Its 
validity,  however,  has  been  attested  to  by  the  number  of  investi¬ 
gators  who  have  reported  it  to  reflect  differences  in  readability, 
when  the  conditions  were  such  as  to  make  such  differences 
axiomatic. 

Eye-blink  frequency,  on  the  other  hand,  has  not  been  as 
widely  validated.  One  authority,  indeed,  questions  the  validity 
of  the  criterion  on  the  grounds  that  Luckiesh  and  Moss  misin¬ 
terpret  the  Ponder  and  Kennedy  conclusions  on  muscle  tension.^ 
Whereas  there  seems  to  be  no  reported  evidence  disputing  this 
criterion,  there  is  also  no  confirming  evidence  from  sources 
other  than  the  Luckiesh  and  Moss  reports.  It  is  notable  that  in 
the  present  study  eye-blink  frequency  was  revealed  to  bear  little 
relationship  to  reading  speed  (the  latter  accepted  as  a  valid  cri¬ 
terion  of  readability)  when  measured  over  periods  as  brief  as 
five  minutes.  It  may  be,  therefore,  that  there  are  real  differ¬ 
ences  in  the  readability  of  some  of  the  types  studied,  but  that 
the  speed  criterion  was  too  crude  to  reveal  them,  and  that  the 
eye-blink  criterion  was  not  a  valid  measure  of  them. 

Possible  effects  of  undetermined  systematic  error 

As  analysis  of  variance  is  probably  the  best  statistical  method 
yet  devised  for  estimating  the  amount  of  error  in  an  experiment 
of  this  kind,  interpretation  3  would  seem  unlikely.  There  are, 
however,  sources  of  error  which  were  not  quantitatively  isolated 
in  the  experiment,  and  these  may  have  swelled  the  residual  vari¬ 
ance  (used  in  obtaining  the  F-value)  to  such  an  extent  that  it 

®  M.  A.  Tinker,  “Illumination  Intensities  for  Reading,”  American  Journal  of 
Ophthalmology,  18:1036-38,  1935. 
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obscured  any  real  differences  in  the  data.  One  of  these  sources 
is  the  possible  unreliability  of  a  five-minute  sampling  period. 
Whereas  other  investigators  have  reported  positive  results  in 
experiments  using  even  shorter  sampling  periods,  their  other 
conditions  were  very  different  from  those  of  this  experiment 
(adult  subjects,  normal  vision,  etc.).  It  is  possible,  therefore, 
that  the  sampling  periods  were  not  long  enough  to  provide  ade¬ 
quate  reliability. 

Another  possible  source  of  error  is  the  variation  within  and 
among  the  observers  who  counted  eye-blink — the  “unreliabil¬ 
ity”  of  the  observers,  so  to  speak.  Some  observers  consistently 
found  lower  eye-blink  totals  than  others,  as  is  revealed  in  Table 
9.  Unfortunately,  this  source  of  error  was  not  randomized,  as 
its  magnitude  was  unforeseen.  The  differences  among  the  ob¬ 
servers  in  this  respect,  however,  does  suggest  that  eye-blink 
frequency  count  may  not  be  an  objective  enough  measure  of 
visual  effort  expended  when  human  observation  is  the  means 
of  determining  it.  Furthermore,  “eye-blink”  itself  is  in  need  of 
definition,  and  even  when  defined  is  necessarily  individually  in¬ 
terpreted.  What  is  an  eye-blink?  Must  the  upper  lid  come  in 
contact  with  the  lower  lid?  If  not,  how  far  down  must  it  come 
to  be  counted  as  a  definite  blink?  Half-way?  Three-fourths  of 
the  way?  Whatever  standard  is  set,  it  is  a  practical  impossibility 
for  the  unassisted  human  visual  mechanism  to  determine  whether 

Table  9 

NUMBER  OF  EYE-BLINKS  COUNTED  BY  EACH 
OBSERVER,  WITH  MEANS 

Total  Eye-Blinks 


Observer  Counted  Mean 


1  .  784  10.88 

2  .  823  11.43 

3  .  1203  16.71 

4  .  1336  18.56 

5  .  1278  17.75 

6  .  1396  19.39 
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the  lids  did  come  in  contact  with  each  other,  or  whether  the 
upper  lid  came  half-way  down,  or  only  seven-sixteenths  of  the 
way.  There  is,  therefore,  a  real  possibility  that  this  criterion  is 
unreliable  when  the  count  is  made  by  human  beings  unassisted 
by  mechanical  devices.  An  experiment  in  which  pairs  of  investi¬ 
gators  simultaneously  counted  the  blinks  of  the  same  subjects 
might  be  revealing  in  this  respect. 

One  other  result  may  be  interpreted  interestingly.  When 
scores  of  hyperopes  and  myopes  were  separated  out,  and  their 
means  tested  for  significant  differences,  the  means  of  the  eye- 
blink  scores  (Table  7,  p.  50)  seemed  to  suggest  that  the  hy¬ 
peropes  blinked  appreciably  more  than  the  myopes  on  the  12- 
and  14-point  types,  and  appreciably  less  on  the  larger  types. 
The  greatest  differences  in  this  latter  respect  occurred  in  the 
case  of  typing  and  24-point  Caslon.  These  differences  suggest 
that  the  larger  types  may  be  more  readable  for  the  hyperopes. 
Less  strongly,  they  point  to  the  possibility  of  the  smaller  types 
being  the  more  readable  for  the  myopes.  The  differences  are 
not  large  enough,  however,  to  approach  conclusiveness. 

As  has  been  pointed  out,  there  was  a  heavy  preponderance  of 
myopes  in  the  experimental  population,  and  relatively  few  hy¬ 
peropes.  It  is  entirely  possible  that,  if  a  large  enough  group  of 
hyperopes  were  included  in  a  study  of  this  kind,  it  might  be 
found  that  for  them  there  are  real  differences  in  the  readability 
of  the  experimental  types.  It  is  also  possible  that  real  differences 
between  them  and  the  myopes,  with  respect  to  the  readability  of 
the  experimental  types,  might  be  revealed. 

Thus,  it  is  evident  that  there  is  a  possibility  that  uncontrolled 
factors  operated  to  obscure  what  differences  the  criteria  might 
otherwise  have  revealed.  Nevertheless,  this  does  not  seem  to  be 
a  strong  possibility  in  the  face  of  the  fact  that:  ( 1 )  a  large  part 
of  the  total  variance  was  accounted  for  by  other  systematic  fac¬ 
tors;  (2)  the  residual  (error)  variance  was  relatively  small;  and 
(3)  the  variance  ascribable  to  type  form  was  even  less  than  that 
predicted  by  the  error  variance. 
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SUMMARY  AND  CONCLUSIONS 

THE  PROBLEM 

There  is  a  need  for  objective  evidence  on  the  readability  of  var¬ 
ious  type  sizes  for  children  whose  vision  is  severely  impaired. 
This  study  was  undertaken  in  an  attempt  to  supply  data  on  the 
relative  readability  of  several  type  sizes  and  variations.  For  the 
purposes  of  this  study,  readability  is  defined  as  the  extent  to 
which  a  given  type  size  or  form  lends  itself  to  being  read  with 
absence  of  visual  effort — with  least  cost  to  the  human  visual 
mechanism — when  comprehension  is  held  relatively  constant. 
The  types  studied  were: 

1.  12 -point  Caslon  No.  3,  Linotype 

2.  14-point  Caslon  No.  3,  Linotype 

3.  18-point  Caslon  Bold  No.  79,  Monotype 

4.  24-point  Caslon  Bold  No.  79,  Monotype 

5.  Ampli-Type,  typed 

6.  Ampli-Type,  Mimeographed 

THE  CRITERIA 

The  criteria  employed  to  measure  readability  were  two:  fre¬ 
quency  of  involuntary  eye-blink  (which  has  been  used  exten¬ 
sively,  and  almost  exclusively,  in  the  readability  studies  of 
Luckiesh  and  Moss),  and  speed  of  reading.  The  relative  visibility 
of  each  of  the  types  studied  was  also  measured,  using  the’ 
Luckiesh-Moss  Visibility  Meter. 

THE  TESTS 

The  reading  tests  used  were  made  up  of  items  from  the  Gates 

speed  of  reading  tests  of  the  Gates  Reading  Survey  and  the 
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Modern  School  Achievement  Test,  plus  72  items,  closely  simi¬ 
lar  to  the  items  in  the  above-mentioned  tests,  written  by  the 
author  of  the  study.  The  entire  300  items  were  then  randomly 
distributed  so  as  to  make  up  six  test  forms  of  50  items  each.  The 
tests  are  of  about  third  grade  difficulty  level. 

Each  test  form  was  printed  in  all  six  of  the  type  variations 
under  study.  As  there  were  six  test  forms  and  six  type  varia¬ 
tions,  there  resulted  thirty-six  different  combinations  of  test 
form  and  type  variation. 

The  typographical  conditions  of  the  tests  were  determined 
by  the  advice  of  several  experts  in  the  fields  of  typography  and 
book  designing  (see  pages  24  to  28).  All  tests  were  printed  on 
paper  8.5  inches  by  11.0  inches. 

PAPER 

The  paper  upon  which  the  tests  were  printed  was  Warren’s 
Olde  Style  Wove,  India,  Substance  60.  It  is  a  light  cream-buff 
colored  book  paper  of  high  quality,  with  a  dull,  light-diffusing 
surface.  Its  characteristics  with  respect  to  glare,  opacity,  smooth¬ 
ness,  and  color  were  ascertained  quantitatively,  and  are  specified 
on  pages  29-30. 


THE  SUBJECTS 

Seventy-two  pupils  from  the  sight-saving  classes  of  New 
York  City  public  schools  were  the  subjects.  They  ranged  in 
age  from  9  to  14  years,  and  in  school  grade  placement  from  the 
fourth  to  eighth  year.  The  great  majority  were  from  10  to  12 
years  old,  and  were  in  grades  5  and  6.  Twenty-nine  were  boys, 
and  forty-three  were  girls. 

Of  the  total,  fifty-two  were  myopic,  ten  were  hyperopic, 
and  the  other  ten  were  distributed  over  seven  other  types  of 
visual  anomaly.  Many  of  the  myopic  and  hyperopic  cases  were 
also  afflicted  with  astigmatism,  and  a  few  had  other  complica¬ 
tions. 

According  to  the  judgment  of  the  classroom  teachers  of  the 
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subjects,  all  the  pupils  tested  had  reading  abilities  above  the 
third  grade  level;  the  great  majority  were  judged  to  be  of  good 
fifth  grade  level  in  reading. 

ADMINISTRATION  OF  THE  TESTS 

For  the  test  administration,  the  pupils  were  divided  into 
twelve  groups  of  six  pupils  each.  The  tests  were  administered 
in  a  Latin  Square  pattern  in  such  a  manner  that  each  pupil  took 
six  test  forms,  each  printed  in  a  different  type  variation,  but  no 
two  children  in  each  group  of  thirty-six  took  the  same  six  com¬ 
binations  in  the  same  order. 

The  tests  were  administered  to  six  pupils  at  a  time,  with  no 
other  pupils  present.  An  adult  observer  (one  for  each  subject), 
seated  beside  and  slightly  behind  the  subject,  counted  the  num¬ 
ber  of  eye-blinks  during  each  five  minute  period  in  which  a  test 
was  administered.  During  the  test  administration  each  desk  was 
receiving  from  27  to  35  foot-candles  of  light  (determined  with 
a  General  Electric  Light  Meter).  At  the  end  of  each  five- 
minute  test  period,  the  number  of  eye-blinks  was  noted  on  the 
title  page  of  the  test  by  the  observer.  There  were  three-minute 
rest  periods  between  all  but  the  third  and  fourth  tests,  and  be¬ 
tween  these  two  was  a  five-minute  relaxation  period.  The  eye- 
blinks  were  observed  and  counted  mentally. 

METHODS  OF  ANALYSIS  OF  THE  DATA 

The  variance  of  the  data  was  analyzed  by  means  of  a  five- 
component  analysis  of  variance.  The  sources  of  variance  which 
were  isolated  were:  variance  among  individuals;  variance  among 
test  forms;  variance  arising  from  test  order;  variance  attributable 
to  differences  in  type;  and  residual  variance.  This  analysis  was 
made  for  each  of  the  two  criteria,  reading  speed  and  eye-blink 
frequency.  The  F-test  of  significance  was  employed  to  evalu¬ 
ate  the  degree  to  which  each  systematic  variance  was  signifi¬ 
cantly  larger  than  the  error  variance. 

In  another  treatment  of  the  data,  two  groups  were  selected 
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from  the  total  number  of  subjects:  a  group  of  forty  myopes  and 
a  group  of  ten  hyperopes.  Within  each  of  these  two  groups 
the  means  of  the  criterion  scores  for  each  type  variation  were 
computed,  and  the  significance  of  the  difference  between  these 
means  was  ascertained.  This  analysis  was  made  to  determine 
whether  there  was  a  statistically  reliable  difference  among  the 
means  of  the  criterion  scores  for  myopic  and  hyperopic  chil¬ 
dren,  a  difference  which  might  have  been  obscured  when  the 
data  of  both  groups  were  lumped  together  in  one  treatment. 

The  data  were  also  examined  to  discover  whether  there  was 
apparent  a  relationship  between  the  criteria  (reading  speed  and 
eye-blink  frequency),  which  we  should  expect  if  they  both 
measure  the  same  thing.  For  each  subject,  the  relationship  be¬ 
tween  the  scores  for  each  of  the  criteria  was  ascertained,  and 
the  number  of  times  in  which  there  was  a  direct  relationship,  an 
inverse  relationship,  and  no  relationship  was  determined. 

The  type  variations  were  also  tested  for  visibility  by  adult 
observers  of  normal  or  near-normal  vision,  using  the  Luckiesh- 
Moss  Visibility  Meter. 

RESULTS 

The  results  of  the  foregoing  analyses  of  the  data  were  as  fol¬ 
lows: 

1.  The  analysis  of  variance  revealed  no  significant  differences 
in  the  speed  with  which  the  different  type  variations  were 
read. 

2.  The  analysis  of  variance  revealed  no  significant  differences 
in  the  frequency  of  eye-blink  for  each  of  the  type  vari¬ 
ations. 

3.  The  speed  criterion  revealed  significantly  large  differences 
among  the  test  forms  and  the  position  in  order  of  testing 
which  did  not  show  up  in  the  eye-blink  scores. 

4.  There  was  no  statistically  reliable  difference  among  the 
means  of  the  type  variation  scores  in  the  case  of  either  the 
myopic  or  the  hyperopic  subjects.  Nevertheless,  sugges- 
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tively  patterned  differences  between  these  groups  seemed 
to  favor  the  larger  types  for  the  hyperopes. 

5.  There  appeared  to  be  no  relationship  between  the  criteria 
employed. 

6.  For  adult  observers  of  normal  or  near-normal  vision, 
24-point  type  is  the  most  visible  of  the  types  studied  (as 
measured  by  the  Luckiesh-Moss  Visibility  Meter),  fol¬ 
lowed  in  order  by:  Ampli-type,  typed  (dark);  18-point; 
Ampli-type,  typed  (light);  Ampli-type,  Mimeographed; 
14-point;  and  12 -point. 

INTERPRETATION  OF  THE  RESULTS 

Three  general  interpretations  of  the  results  seem  to  be  pos¬ 
sible: 

1.  For  visually  handicapped  children,  reading  for  brief  pe¬ 
riods,  there  are  no  real  differences  in  the  readability  of  the 
types  studied. 

2.  For  visually  handicapped  children  there  are  real  differences 
in  the  readability  of  the  type  variations  studied,  but  the 
criteria  used  did  not  measure  them  adequately. 

3.  For  visually  handicapped  children  there  are  real  differences 
in  the  readability  of  the  type  variations  studied,  but  they 
were  obscured  by  error  for  which  the  experimental  design 
provided  no  adequate  estimate,  and  which  swelled  the 
error  variance. 

Examination  of  the  results  leads  to  the  belief  that  interpreta¬ 
tion  3  is  the  least  likely.  On  the  other  hand,  there  is  evidence 
that  the  criteria — particularly  the  criterion  of  eye-blink  fre¬ 
quency — did  not  measure  adequately  what  differences  there 
may  have  been.  This  may  have  been  because  of  the  brevity  of 
the  sampling  periods.  A  strict  interpretation  of  the  results  of 
the  statistical  analysis  would  incline  one  to  state  that,  for  the 
visually  handicapped  children  studied,  there  is  no  real  difference 
in  the  readability  of  the  type  variations  tested.  In  spite  of  the 
lack  of  statistically  significant  differences,  however,  there  is  a 
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possibility  that  for  the  subjects  used  in  this  study  there  are  dif¬ 
ferences  in  the  readability  of  the  type  forms  investigated  but 
they  were  obscured  by  the  operation  of  some  other  factor  or 
combination  of  factors. 


CONCLUSIONS 

The  results  of  this  study  will  not  permit  the  conclusion  that 
any  one  of  the  experimental  types  is  preferable  to  any  of  the 
others  for  use  in  sight-saving  classes.  Certain  other  conclusions 
seem  tenable,  however: 

1.  For  the  subjects  used  in  this  study,  reading  for  five-minute 
periods  under  from  27  to  35  foot-candles  of  illumination, 
there  is  not  as  great  a  difference  in  the  readability  of  the 
type  forms  studied  as  has  been  commonly  supposed. 

2.  There  is  a  need  for  further  evaluation  of  the  validity  and 
reliability  of  eye-blink  frequency  as  a  criterion  of  reada¬ 
bility.^ 

3.  Studies  of  this  type  should  employ  sampling  periods  longer 
than  five  minutes,  or  space  a  number  of  brief  samples  from 
a  longer  total  working  period. 

4.  Studies  of  readability  (as  herein  defined)  cannot  be  con¬ 
clusive  until  and  unless  some  more  valid  and  sensitive  cri¬ 
terion  of  visual  fatigue,  or  of  visual  effort  expended,  is  de¬ 
veloped. 

^Recent  studies  conducted  at  Harvard  University  yield  additional  evidence 
to  support  the  belief  that  eye-blink  frequency  may  be  a  poor  criterion  for  the 
measurement  of  visual  effort  expended,  or  of  visual  fatigue.  The  report  of 
these  studies  was  received  too  late  to  be  incorporated  into  the  body  of  this 
manuscript.  See  R.  A.  McFarland,  A.  H.  Holway,  and  L.  M.  Hurvich,  Studies 
of  Visual  Fatigue,  pp.  71-86.  Harvard  University  Press,  Cambridge,  Mass.,  1942. 
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